introduction to apache lucene solr n.
Skip this Video
Loading SlideShow in 5 Seconds..
Introduction to Apache Lucene/Solr PowerPoint Presentation
Download Presentation
Introduction to Apache Lucene/Solr

Loading in 2 Seconds...

play fullscreen
1 / 12

Introduction to Apache Lucene/Solr - PowerPoint PPT Presentation

  • Uploaded on

Introduction to Apache Lucene/Solr. CSCI 572: Information Retrieval and Search Engines Summer 2010. Outline. What is Lucene/Solr? Where did it come from? What are the current versions of Lucene/Solr? What can it do?. Apache Lucene. The brainchild of Doug Cutting

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Introduction to Apache Lucene/Solr' - uma-weber

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
introduction to apache lucene solr

Introduction to Apache Lucene/Solr

CSCI 572: Information Retrieval and Search Engines

Summer 2010

  • What is Lucene/Solr?
  • Where did it come from?
  • What are the current versions of Lucene/Solr?
  • What can it do?
apache lucene
Apache Lucene
  • The brainchild of DougCutting
  • Free-text indexing library that implements most of the functionality I’ve talked to you about
    • Query Models, Ranking, Indexing
  • Core API is implemented in Java
    • C++/C, Ruby, Python APIs as well, but small communities or automatically generated
  • Initially Sourceforge, moved to Apache in 2001
apache solr
Apache Solr
  • Originally developed at CNET
  • Web service layer built on topof Lucene library
  • Provides schema andunderstanding of field types, conversion to and from representation
  • Provides huge-scale scalability, deployed on top of application server like Tomcat or Jetty
  • P/L independent programming APIs
  • Sharing, replication, faceting, highlighting, explain, more like this and other functionality provided easily
how to get started
How to get started
  • Lucene (2.9.2 and 3.0.1 stable)
    • Put your Java hat on
    • Have Eclipse ready or your favorite IDE
    • Download lucene-core-<version>.jar from
    • Download src and build from
    • Check out some example Java code that demonstrates indexing and querying from Otis Gospodnetic
how to get started1
How to get started
  • Solr
    • Grab a release of Solr (1.4.0 stable)
    • Unpack into e.g., /usr/local/solr
    • Deploy onto tomcat
      • Install tomcat into /usr/local/tomcat
      • Create solr.xml file and drop into /usr/local/tomcat/conf/Catalina/localhost/
        • Create solr.home JNDI property and point to /usr/local/solr/solr
      • Start tomcat
    • Head over to $solr/example/example-docs
      • curl http://localhost:8983/solr/update -H 'Content-type:text/xml; charset=utf-8' --data-binary @artists.xml
modifying your schema xml
Modifying your schema.xml
  • Field Types
  • Analyzers
  • Tokenizers

solr faceting
Solr Faceting
  • facet=on&facet.field=&facet.field=…
advanced topics
Advanced Topics
  • Standing up cores
  • Sharding
  • Replication
  • Zookeeper and Cloud
development currently in flux
Development currently in flux
  • Stick with release versions
  • Depending on trunk won’t really help
  • Lucene and Solr have merged
  • Lots more information at
  • Possible projects
    • Geospatial search
      • Improving existing code and contributing back to Apache SIS and to Apache Solr
    • Improving date faceting
    • Rewriting the ResponseWriter framework
  • Material inspired by discussions and talks on the Apache Mailing lists for Solr, Lucene and through discussions with the rest of the Lucene community