The afternoon sessions started with the awarding of the first Everett Brenner Award award for the Best Contribution to Knowledge at the 2006 Search Engine Meeting. The winner was Stavros Macrakis formerly of Lycos and now with FAST and who ironically is scheduled to be the last speaker this afternoon and of the conference.
The sessions this afternoon dealt with web and intelligent tools. The first speaker was Paul Thompson of Dartmouth College. His talk “Search and Misinformation in Intelligence and Security Informatics” was quite interesting in that it deals with a relative little researched area. He said a what was needed was a new science along the lines of bioinformatics. Fraud was increasing and he cited one prominent journal which reported that at least 20% of accepted manuscripts, let alone those not accepted, contained at least one occurrence of fraud. He went into some detail of his research done over the last few years. His paper will be online Friday at the conference web site for those interested in this subject.
(more…)
No Comments »
After the morning break we reconvened for the panel discussion, Search: The Next Decade. Participants included;
- Susan Feldman (Moderator), IDC
- Suranga Chandratillake, blinkx
- Josh Jacobs, X1
- Andrew McKay, FAST
The panel discussion lasted an hour and half. However it was not so much a panel but rather a forum for the speakers to posit their ideas on the future. They spent nearly an hour and 15 minutes on this leaving barely 15 minutes from other panel members or the audience for questions.
One recurring theme is that managing information for the average user is time consuming.
To illustrate this a slide was shown which listed a breakdown of much time we spend on different tasks. Here’s the breakdown for businesses.
- We spend an average 14.5 hours a week on email which costs the company $21k a year.
- We spend an average 13.3 hours a week creating documents costing $19K a year
- We spend an average 9.9 hours a week analizing docs costing $17k a year
- We spend an average of 9.5 hours a week searching costing $14k a year
and there was much more.
Another item mentioned is that Google is not the end of search, there’s lot’s of innovation and growth to come.
(more…)
No Comments »
An interesting post but I don’t think so. SEO’s will adapt to the shifting market as they always do.
Could vertical search supplant SEO?: “I’m at The Search Engine Meeting in Boston listening to Vivisimo’s Raul Valdes-Perez promoting vertical search. In his vision, companies will aggregate their own information universes, discreetly putting their own information first. This can turn them into go-to sites for…”
(Via BusinessWeek Online — Blogspotting.)
No Comments »
Well we’re in a break right now and the three morning talks were all interesting, especially Stephen Arnold of AIT. Arnold is a long time information knowledge technophile and pulls no punches when discussing the industry, and that’s a good thing, as we need perspectives from all angles on search. So his talk “Google: The Erosion of Relevance” is one I’ve been anticipating. And he did not disappoint.
He started by calling Google, Googzilla. This drew some laughter from the crowd. He pointed out that all the talks from the previous day touched upon Google in some way. We’re obsessed with Google, we love Google. And since everyone loves Google we seem to be missing something. And that something is that search relevance is being eroded. And he wasn’t just critical of Google. He said all the major search engines, Microsoft and Yahoo included, and everyone else is eroding relevance. So exactly what does he mean by this?
His basic premise is that content is being steered, thus its relevance is being eroded. Search engine results are being skewed as people learn how to manipulate them. This erodes relevance. He cites emerging social tools, that, while people find cool, actually are doing us all a disservice by skewing the results. Some examples include del.icio.us which he says poses big problems, as we are overlooking what happens when humans use random terms to classify links. He cited Flickr who use “word” tags which erode relevance. He also cited digg, and how it appeared to be more popular then Slashdot but has recently run into problems in that some users had figured out to skew the results so their posts ranked at the top.
What about the search engine themselves? Are they allowing relevance to be eroded? His answer is not a simple one. A part of him says yes while another says no. Are they doing it intentionnally, no. However users are learning how to manipulate the results. To me it’s an ongoing game between the search engines and the users who want to rank high. He also notes that current search compnies are not focusing on the problems of search. Not everyone would agree with him on this point.
(more…)
No Comments »
A last note about the morning sessions. Mike Moran of IBM has put his talk, “Don’t just change the search engine (powerpoint)” on his blog.
The afternoon sessions turned out to be less interesing and useful to me than I had hoped except for a couple of talks.
The first talk was by Claude Vogel of Convera. Convera, formely Excalibur, has been providing enterprise class search for many year now and their products have matured well over that time. Claude spoke about speeding search using faceting, and as later speaker defined it, “a facet is a certain classifiable characteristic of the resource — a way to classify something.” To me the talk seemed more of a marketing speech for their product Excalibur which is not surprising.
Next up was Abe Lederman of Deep Web Technologies who spoke of challenges in scaling federated (metasearch) searches. He characterized the challenges in searching thousands of sources as follows:
- Determining the sources to search
- Retrieving searches from cache and how often to update the cache
- Performing many searches in parallel
- The need to bring the best documents back
And of course how to rank the results. He touched upon the methods they used including;
- Multi-user Relevance Ranking which includes QuickRank - occurence of search terms, MetaRank - custom algorithm applied to metadata, DeepRank - indexing fulltext documents
- User-driven Ranking
- Clustering
(more…)
No Comments »