Popularity Metrics
Search November 18th, 2003
In Tim Bray’s latest essay on search he points out what I feel is an often overlooked aspect of Google’s PageRank when it is applied to enterprise search:
“[PageRank] Won’t Work for You If you’re writing or deploying a search engine for your Intranet or product catalogue or portal, Google’s PageRank trick probably won’t work, because most Intranet and catalogue and portal pages don’t point at each other. The Web is unique in that it has millions of authors independently making decisions about what’s important; aggregating those decisions is what makes PageRank so powerful.”
“The Real Lesson of Google PageRank works on the basis of guessing that whats popular is whats important, which turns out to be a good guess. The technique relies on the Webs linkage network, and while non-Web deployments cant use that, we shouldnt give up on the notion of using a popularity metric.”(via Ongoing)
Indeed, and lowering the barriers to collaboration via corporate blogs and wikis will provide a rich corpus of popularity metrics.
In addition, I believe there is untapped potential in mining the ongoing dialogs found in email, IM and messages boards for relevancy references. I’m not saying however, that you use the dialogs themselves in search results; I’m simply suggesting that digital conversational data can be used to determine the popularity and thus importance of an object within a corporation.
Although I’m sure privacy advocates will fault with this notion.
Related Posts
- PageRank within the Enterprise -- Tim Bray’s latest On Search essay alludes to the effectiveness of Google’s PageRank within the Enterprise: “…even if it turns out that popularity [PageRank] is the key thing for Internet search, the Internet is a very special place, and its...
- Precision and Recall -- Tim Bray posts his third in the series on Search. This one is on Precision and Recall. Here are a few good quotes: “While precision and recall are very helpful in talking about how good search systems are, they are...
- Search is Commoditized -- A quote from Tim Bray’s first in a series on search technology: “All search engines work more or less the same, and offer more or less the same APIs, and provide more or less the same quality of result.” Interesting....
- The Search for Intelligent Search -- The latest in Tim Brays series on search I think this one is the best yet. Here’s a quote: “Consider what a really intelligent search engine would have to do. It would have to read an arbitrary selection of documents...
- Trust Metrics -- Mike Sugarbaker: “It’s very simple: there are some friends you’d trust with your seat at Starbucks, but far fewer you’d trust with your car keys, and even fewer with, say, your sister in a tube top.” ...
- links for 2006-02-05 -- Daisy 1.4 released “Daisy is an open source that support book-quality publishing, … software manual production and knowledge management applications” (tags: CMS opensource Open_Source Java XML Cocoon KM) The PageRank Algorithm “The PageRank algorithm is in fact elegantly simple...
- Nobody Uses Advanced Search -- Tim Bray’s second installment on search… “Every search engine has an advanced search screen, and nobody (quantitatively, less than 0.5% of users) ever goes there. This drove us nuts back at Open Text, because our engine was very structurally savvy...
About
Doh! Thanks to the “related entries” feature, I should re-read my own pervious posts. I basically said the same thing back in July in reference to another of Tim’s essays.
http://www.hatch.org/blog/2003/07/30/pagerank_within_the_enterprise.php