Difference between revisions of "Basics of Search Engines and their History"

From PublicWiki
Jump to: navigation, search
Line 3: Line 3:
 
Presented by Mike Cafarella
 
Presented by Mike Cafarella
  
:*J. Kleinberg, and S. Lawrence. The structure of the Web, Science 294, 1849-1850, November 2001. The paper which addresses the overall structure of the web with "core", "in", "out", and "other" sections. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/kleinberg_structure_of_the_web.pdf
+
'''Before Class''':
:*[http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,285,999.WKU.&OS=PN/6,285,999&RS=PN/6,285,999 PageRank patent].
+
:* Read
 +
::J. Kleinberg, and S. Lawrence. The structure of the Web, Science 294, 1849-1850, November 2001. The paper which addresses the overall structure of the web with "core", "in", "out", and "other" sections. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/kleinberg_structure_of_the_web.pdf
 +
::*http://www.wiley.com/legacy/compbooks/sonnenreich/history.html.  This is a useful history of search engines.
 +
::The introduction to *Jon Kleinberg. <i>Authoritative sources in a hyperlinked environment</i>. 1999. Journal of the ACM v. 46(5). http://www.cs.cornell.edu/home/kleinber/auth.pdf.
 +
 
 +
 
 +
 
 +
'''Other Resources''':
 +
:*Sergey Brin and Lawrence Page. 1998. <i>The anatomy of a large-scale hypertextual Web search engine</i>. Computer Networks and ISDN Systems v. 30. http://www-db.stanford.edu/pub/papers/google.pdf. The original paper describing PageRank.
 
:*http://www.iprcom.com/papers/pagerank/. Another explanation of PageRank.
 
:*http://www.iprcom.com/papers/pagerank/. Another explanation of PageRank.
 +
:*A. Altman and M. Tennenholtz. Ranking Systems: The PageRank Axioms, ACM Conference on Electronic Commerce, 1-8, 2005. A paper which shows that under certain assumptions PageRank is the only ranking allowed. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/pagerankaxioms.pdf
 
:*"A History of Online Information Services, 1963-1976" by Bourne and Hahn.  I (Mike Cafarella) have this book and can lend it out if you're interested in primordial search engines.
 
:*"A History of Online Information Services, 1963-1976" by Bourne and Hahn.  I (Mike Cafarella) have this book and can lend it out if you're interested in primordial search engines.
:*http://www.wiley.com/legacy/compbooks/sonnenreich/history.html.  This is a useful history of search engines.
+
:*[http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/srchnum.htm&r=1&f=G&l=50&s1=6,285,999.WKU.&OS=PN/6,285,999&RS=PN/6,285,999 PageRank patent].
:'''For the more technically inclined''':
 
:*Sergey Brin and Lawrence Page. 1998. <i>The anatomy of a large-scale hypertextual Web search engine</i>. Computer Networks and ISDN Systems v. 30. http://www-db.stanford.edu/pub/papers/google.pdf. The original paper describing PageRank.
 
:*Jon Kleinberg. <i>Authoritative sources in a hyperlinked environment</i>. 1999. Journal of the ACM v. 46(5). http://www.cs.cornell.edu/home/kleinber/auth.pdf. The introduction is a good read if you don't want to read all of it.
 
:*A. Altman and M. Tennenholtz. Ranking Systems: The PageRank Axioms, ACM Conference on Electronic Commerce, 1-8, 2005. A paper which shows that under certain assumptions PageRank is the only ranking allowed. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/pagerankaxioms.pdf
 

Revision as of 00:21, 8 April 2006

Basics of search engine algorithms (web crawling, building indexes, etc.). How has the field evolved? How have the services that search engines provide changed through the years? From WebCrawler to Inktomi to Yahoo!. How does Google's PageRank work? The use of WWW link structure to identify authoritative sources for user queries.

Presented by Mike Cafarella

Before Class:

  • Read
J. Kleinberg, and S. Lawrence. The structure of the Web, Science 294, 1849-1850, November 2001. The paper which addresses the overall structure of the web with "core", "in", "out", and "other" sections. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/kleinberg_structure_of_the_web.pdf
The introduction to *Jon Kleinberg. Authoritative sources in a hyperlinked environment. 1999. Journal of the ACM v. 46(5). http://www.cs.cornell.edu/home/kleinber/auth.pdf.


Other Resources: