Difference between revisions of "Basics of Search Engines and their History"

From PublicWiki
Jump to: navigation, search
 
(6 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
Basics of search engine algorithms (web crawling, building indexes, etc.). How has the field evolved? How have the services that search engines provide changed through the years? From WebCrawler to Inktomi to Yahoo!. How does Google's PageRank work? The use of WWW link structure to identify authoritative sources for user queries.
 
Basics of search engine algorithms (web crawling, building indexes, etc.). How has the field evolved? How have the services that search engines provide changed through the years? From WebCrawler to Inktomi to Yahoo!. How does Google's PageRank work? The use of WWW link structure to identify authoritative sources for user queries.
 
    
 
    
Presented by Mike Cafarella
+
'''Presented by Mike Cafarella''' [http://www.cs.washington.edu/homes/mjc/ homepage]
 +
 
  
 
'''Before Class''':
 
'''Before Class''':
Line 7: Line 8:
 
::*A useful history of search engines: http://www.wiley.com/legacy/compbooks/sonnenreich/history.html   
 
::*A useful history of search engines: http://www.wiley.com/legacy/compbooks/sonnenreich/history.html   
 
::*Wikipedia's article on search engines: http://en.wikipedia.org/wiki/Search_engines
 
::*Wikipedia's article on search engines: http://en.wikipedia.org/wiki/Search_engines
::*(2 pages) J. Kleinberg, and S. Lawrence. The structure of the Web, Science 294, 1849-1850, November 2001. The paper which addresses the overall structure of the web with "core", "in", "out", and "other" sections. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/kleinberg_structure_of_the_web.pdf
+
::*(~4 pages) The '''introduction''' to Jon Kleinberg. <i>Authoritative sources in a hyperlinked environment</i>. 1999. Journal of the ACM v. 46(5). http://www.cs.cornell.edu/home/kleinber/auth.pdf.
::*(~4 pages) The introduction to Jon Kleinberg. <i>Authoritative sources in a hyperlinked environment</i>. 1999. Journal of the ACM v. 46(5). http://www.cs.cornell.edu/home/kleinber/auth.pdf.
+
::*(2 pages) Addresses the overall structure of the web: J. Kleinberg, and S. Lawrence. The structure of the Web, Science 294, 1849-1850, November 2001. http://www.cs.washington.edu/education/courses/cse522/CurrentQtr/kleinberg_structure_of_the_web.pdf
 
:* '''If you're a CSE student OR technically brave then read'''
 
:* '''If you're a CSE student OR technically brave then read'''
::*Sergey Brin and Lawrence Page. 1998. <i>The anatomy of a large-scale hypertextual Web search engine</i>. Computer Networks and ISDN Systems v. 30. http://www-db.stanford.edu/pub/papers/google.pdf. The original paper describing PageRank.
+
::*The original paper describing PageRank: Sergey Brin and Lawrence Page. 1998. <i>The anatomy of a large-scale hypertextual Web search engine</i>. Computer Networks and ISDN Systems v. 30. http://www-db.stanford.edu/pub/papers/google.pdf.  
:::::OR
+
::::::::'''OR'''
 
::*Finish the Kleinberg article
 
::*Finish the Kleinberg article
  

Latest revision as of 19:12, 9 April 2006

Basics of search engine algorithms (web crawling, building indexes, etc.). How has the field evolved? How have the services that search engines provide changed through the years? From WebCrawler to Inktomi to Yahoo!. How does Google's PageRank work? The use of WWW link structure to identify authoritative sources for user queries.

Presented by Mike Cafarella homepage


Before Class:

  • Read (suggested in this order)
  • If you're a CSE student OR technically brave then read
OR
  • Finish the Kleinberg article
  • Otherwise, read...

Other Resources: