1 / 22

Searching the Deep Web

Searching the Deep Web. LEMA, February 2011. Deep Web Video. Surface Web: accessible via general-purpose search engines such as Google and Yahoo!. 25%. 1 trillion + Pages. 500 trillion + Pages!!. Deep Web: Not accessible via typical search engines; primarily databases. 75%.

thanos
Download Presentation

Searching the Deep Web

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Searching the Deep Web LEMA, February 2011 Deep Web Video

  2. Surface Web: accessible via general-purpose search engines such as Google and Yahoo! 25% 1 trillion + Pages 500 trillion + Pages!! Deep Web: Not accessible via typical search engines; primarily databases 75% AKA visible vs. invisible web Image from express.howstuffworks.com, 14 Feb 11

  3. The “deep web” contains … • Databases which use dynamic or temporary links • Often ?, &, CGI, other elements in the URL • Websites which aren’t indexed, by design or because there are no links to it • Deep web sites • Google limits the amount of a web site it indexes, an unpublished factor in its secret algorithm • At one point, only 110K • Formats that aren’t currently supported • Google now shows results for .pdf, .doc, .ppt  Boundary between surface and deep web always in flux as search engines incorporate more of the deep web at the same time more is being added to the deep web

  4. Deep Web: Why important? • Studies show that students’ searching habits are fairly ingrained by college • Use Google for everything • Only look at the 1st page of results • Assume trustworthiness of web sites • Rich source of in-depth material not accessible through a typical Google search  Expose students now to richer and more authoritative resources.

  5. Students need to understand …. • The best results are NOT in the top 10 • Everything’s NOT on the web • Google does NOT search the whole web • Everything’s NOT free • Everything’s NOT trustworthy • Searching/Research is NOT always easy

  6. How can we help our students be better searchers? • Introduce them to the idea that Google isn’t everything & why • Reinforce the idea of evaluating resources • Make them better “surface” searchers • Many information needs can be met with the surface web • Easy yet “advanced” Google searching techniques • Better alternatives to the “surface web” & how to effectively search these alternatives • Databases! • Familiarity with “deep” sites on a particular topic • Example: Primary materials available at Library of Congress • Example: Legislative info at thomas.loc.gov • Familiarity with portals and directories

  7. Three simple techniques to being a better Google searcher …. • Phrase searching • “xxx xxxx” • Searching the title of web pages • intitle: xxx or intitle:”xxx xxxx” • Example: intitle:”climate change” • Example: intitle:unicorn • Specifying a site • site:.xxx or site:xxx.com • egyptsite:washingtonpost.com • “climate change” site:.gov NOTE: No space after colon Lowercase commands

  8. Let’s try a site: search …. • Look for a Washington Post article on the B-52s

  9. Now let’s try a phrase search… • First, try Howard Morris as a simple keyword search -- How many hits?

  10. Now try it as a phrase “Howard Morris” • How many hits?

  11. Now let’s try an intitle: search • First, just search for “climate change” – how many hits?

  12. An intitle: search • Now try searching for “climate change” in the title of the web page – how many hits?

  13. Searching the Deep Web • LVHS Library Web Page – Deep Web link on the left • Google search for your topic and add keyword database • Ex: Plane crashes database

  14. The Deep Web: A Comparison • Using Google, search on the term metabolism • Open a separate tab, go to www.science.gov and search metabolism again • Looking at the top ten results of each, which provided generally “better” information? • How difficult/easy is it to pursue your search in related fields?

  15. Directories/Portals of Interest • Ipl2 • January 2010 • Merge of Internet Public Library and Librarians’ Internet Index • Librarians and Information Science Professionals • Hosted by Drexel University’s College of Information Science & Technology • Infomine • University-level scholarly resources • Librarian built and maintained • University of California • Virtual Private Library

  16. Other Resources • LVHS Library Web Page – Deep Web link on the left • Going Beyond Google: The Invisible Web in Learning and Teaching by Jane Devine and Francine Egger-Sider, 2009 • Not as up-to-date as web resources, but • Very focused on teaching

  17. Any questions?

More Related