Mitch Pronschinske is a Senior Content Analyst at DZone. That means he writes and searches for the finest developer content in the land so that you don't have to. He often eats peanut butter and bananas, likes to make his own ringtones, enjoys card and board games, and is married to an underwear model. Mitch is a DZone Zone Leader and has posted 2573 posts at DZone. You can read more from them at their website. View Full User Profile

Solr as an Analytics Engine - A Key Theme at this Week's Lucene Revolution

  • submit to reddit

The de facto Lucene and Solr conference is underway today in Boston, Massachusetts.  I believe it's the third annual US version of the conference.  A European version of the conference kicked off for the first time last year in October and it was called Lucene EuroCon.  The conferences have been very popular among the growing number of developers who are being asked to focus on building Lucene/Solr into their companies' software.  If you're missing it right now, no need to worry, there ought to be some videos shared from the sessions soon enough. 

Some of the best looking sessions include interesting info about:

  • Walking through the usage of Wikipedia's data dump for testing via the wikipedia_Solr and wikipedia_elasticsearch repositories
  • How SolrCloud Changes the User Experience in a Sharded Environment
  • An overview of finite-state technology and then describe how it's used today in Lucene: synonym filtering, fuzzy queries, respelling/suggesting, terms dictionary, in-memory postings format (MemoryPostingsFormat) and Japanese analysis (Kuromoji analyzer).
  • Integrating Lucene into a transactional XML database (xDB)
  • Solr for Analytics: integrating text analytics components into Solr, adjustments to the Solr Schema, as well as UI-level changes that support the integration of structured and unstructured data from several sources.
  • Building Query Auto-Completion Systems with Lucene 4.0
  • And tons of case studies of Solr/Lucene being used at innovative software companies

It's amazing all the things you can use Solr and Lucene for.  I'm particularly interested in Solr as an Analytics Engine.