Mitch Pronschinske is a Senior Content Analyst at DZone. That means he writes and searches for the finest developer content in the land so that you don't have to. He often eats peanut butter and bananas, likes to make his own ringtones, enjoys card and board games, and is married to an underwear model. Mitch is a DZone Zone Leader and has posted 2574 posts at DZone. You can read more from them at their website. View Full User Profile

Improving Solr's Update Chain

  • submit to reddit

Improving Solr's Update Chain, Jan Høydahl, Cominvent AS, Eurocon 2011 from Lucene Revolution on Vimeo.

Solr features a little known internal document processing pipeline called the UpdateRequestProcesssorChain or simply the UpdateChain.

In this talk we'll discuss the importance of document processing, when the UpdateChain works well and what limitations it's got. We'll then go on to propose a range of possible improvements.

Topics include:

  • Examples of use with demo
  • How to write your own UpdateProcessor, best practices
  • Example: Tika as an UpdateProcessor
  • A vision for future improvements

Download session slides.