1 / 32

“ Leveraging SharePoint 2010 Search Technologies ”

“ Leveraging SharePoint 2010 Search Technologies ”. With: Ivan Neganov. Sponsors. Agenda. Open Discussion Topic of the day QA. Leveraging SharePoint 2010 Search Technologies. Mississauga SharePoint User Group, October 19, 2010. About the Speaker. Ivan Neganov

fynn
Download Presentation

“ Leveraging SharePoint 2010 Search Technologies ”

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. “Leveraging SharePoint 2010 Search Technologies” With: Ivan Neganov

  2. Sponsors

  3. Agenda • Open Discussion • Topic of the day • QA

  4. Leveraging SharePoint 2010 Search Technologies Mississauga SharePoint User Group, October 19, 2010

  5. About the Speaker Ivan Neganov Founder of SoftForte, Inc. 11 years of experience in developing WCM solutions based on ASP.NET and SharePoint platforms. Focusing on SharePoint since 2007. Blog: neganov.blogspot.com the Science of Quality

  6. Agenda • Enterprise Search defined • Common search concepts and terms • Search architecture • SharePoint search technologies

  7. What is Enterprise Search • Why not use Google Appliance aka “Google Box”? • Why not use open source engine like Lucene? • Why SharePoint search isn’t enough? • Do I need taxonomy & faceted search? • Can users just go ahead and tag everything?

  8. Enterprise is not just a large Intranet • Large volumes of data • Usually there exists a “right” or highly relevant document • Security is critical • Taxonomies and vocabularies are important • Dates are important • Corporate data does have structure • Search is convenient for surfacing content • Search is promising for future BI applications

  9. Search Scenarios • Two types of scenarios in an enterprise: • Productivity search • Intranet/team collaboration search • People search/Social computing • Site search • Search applications • Parts search (fuzzy search requirement) • Intelligence & Investigation (heavy use of entity extraction) • IP protection • Compliance/Records management • E-commerce • Knowledge management & Support • BI applications

  10. Microsoft Search Technologies • Desktop search, successor of Index Server • SQL Server Search – Full Text Search (FTS) • Exchange Search – uses same iFilters as SharePoint • Bing (formerly live search) • Bing + Yahoo = 9.5% • SharePoint & FAST Search

  11. SharePoint 2010 Search Technologies • Microsoft SharePoint Foundation (Free) • Single site collection, 10 million items • No external search • Automatic configuration • Microsoft Search Server 2010 Express (Free) • Enterprise-level search, 10 million items but single search server only • No people search • Microsoft Search Server 2010 • Enterprise-level, redundancy support, 100 million items • No people search • Microsoft SharePoint Server 2010 • 100 million items, added people search, tagging • Microsoft FAST Search Server for SharePoint • Over 200 million items • Improved and flexible relevancy • Entity extraction • Microsoft FAST ESP Server • Advanced entity extraction • Standalone product

  12. Relevancy • Google: PageRank algorithm • Same approach is used in FAST and SharePoint 2010 • FAST provides ability to dynamically boost rank

  13. Index

  14. Linguistics • Word stemming • Word lemmatization • Word morphology • Collapsing indices

  15. Other Common Search Concepts • Crawling • Querying • Crawled & Managed Properties • Best Bets • Refiners aka Facets • Linguistics: Stemma & Lemma • Entity Extraction

  16. High Level Search Architecture

  17. Demo: Search Experience

  18. FAST Search Server 2010 for SharePoint • Advanced scalability & performance • Advanced content processing • Extensibility FAST Content Processing Pipeline:

  19. FAST ESP • Essentially re-packaged FAST ESP 5.3 • Planned two SKUs (according to SPC 2009) • FAST Search Server for Internet Sites • Fast Search Server for Internal Applications • Updates?

  20. Planning Enterprise Search • Search is redundant and scalable

  21. Planning FAST Search

  22. Which Search Technology Is Appropriate? • FAST Search Server requires enterprise CALs

  23. Estimating Costs

  24. Search UI • Search Web Parts • Search Center • Thick clients

  25. Extending Search • Federation - OpenSearch • Query Object Model • BCS Connectors • RANK & XRANK • Tapping in Document Processing Pipeline

  26. Federation

  27. Demo: Search Federation

  28. Connector Framework • Leverage tooling (SPD, VS2010)

  29. Entity Extraction in FAST • Automatically create crawled properties for a given vocabulary • Useful for advanced scenarios: for example 1. Extract property at crawl time, 2. Enrich a property 3. Index enriched property

  30. Search in the Enterprise: Future • Amount of content will continue to grow • Search will integrate with Business Intelligence applications • Entity, Sentiment and Fact extraction • Search as navigation • Search visualization • Search as a service • Many more custom applications leveraging search

  31. Resources • Microsoft Technet, MSDN • Professional Microsoft Search 2010

  32. Questions

More Related