Lucene and Solr: The Merger
This article is part of the DZone Solr-Lucene Zone, which is brought to you in collaboration with the Solr/Lucene Community. Visit the Solr-Lucene Zone for additional tutorials, videos, opinions, and other resources on this topic. If you take a look on the Solr wiki to find out when the next version is going to be released (you'd assume it's version 1.5), you find a message saying that Solr 1.5 is "not likely to ever be released." Before you start crying, you need to look at the slides from Yonik Seeley's presentation at the Apache Lucene EuroCon 2010 (Seeley is the creator of Apache Solr and a co-founder at Lucid Imagination). If you do, you'll see that Solr and its partner project, Lucene, are merging development. It's possible that the next version of Solr will be named 3.1 or 4.0 to match Lucene.
The two pieces of software will remain separate entities, but they will be developed together. Both projects currently sit under the same Subversion trunk and Solr will actually run off the latest Lucene code at all times. One of the reasons that precipitated this change was the adding of certain features to Solr, for example, while they arguably "belonged" to Lucene. Moving this code to Lucene would bar some Solr committers from having access to it. Another fear is diverging, yet similar code because the pace of releases is beginning to change for both projects.
With a merged development, there is a single set of committers that can access any Lucene or Solr code. The two projects will have releases at the same time. Mark Miller of Lucid Imagination lists several more advantages:
Other benefits of the merger:
About Solr
Solr takes Lucene and packages it up as an HTTP server. It runs as a web application inside a servlet container such as Tomcat or Jetty, providing the functionality of Lucene as well as other search capabilities such as faceting. The Lucene functionalities handle distributed search capabilities along with replication for failover and load balancing. Solr also provides easy configuration through XML.
Grant Ingersoll, a Solr and Lucene committer, blogged about the exciting new features planned for the next version of Solr.
Search functionality is not something you should take lightly when building a website or application. The poorly-performing search utilities on various websites are a constant reminder of this. However, search is not just Google's game anymore. When a Java library called Lucene was introduced into the Apache ecosystem, and then Solr was built on top of that, open source developers began to wield some serious power when it came to customizing search features. It's time to enter the Solr-Lucene Microzone, supported by Lucid Imagination, to learn how to harness the full potential of search. Lucid Imagination's enterprise Solr development platform—LucidWorks Enterprise—can make your production process simpler and more cost-efficient with open source configurability.
The two pieces of software will remain separate entities, but they will be developed together. Both projects currently sit under the same Subversion trunk and Solr will actually run off the latest Lucene code at all times. One of the reasons that precipitated this change was the adding of certain features to Solr, for example, while they arguably "belonged" to Lucene. Moving this code to Lucene would bar some Solr committers from having access to it. Another fear is diverging, yet similar code because the pace of releases is beginning to change for both projects.
With a merged development, there is a single set of committers that can access any Lucene or Solr code. The two projects will have releases at the same time. Mark Miller of Lucid Imagination lists several more advantages:
Lucene will likely benefit from Analyzers and QueryParsers that were only available to Solr users in the past. Lucene will also benefit from greater test coverage, as now you can make a single change in Lucene and run tests for both projects – getting immediate feedback on the change by testing an application that extensively uses the Lucene libraries. Both projects will also gain from a wider development community, as this change will foster more cross pollination between Lucene and Solr devs (now just Lucene/Solr devs).
Other benefits of the merger:
- R
eleases will be streamlined - Solr users get the latest improvements from Lucene faster, and vice versa
- Lucene users will get access to Solr's faceting feature
- Minimized duplication of features
- Combined Solr/Lucene test suit should kill a lot of bugs
- More committers means more ideas and development
About Solr
Solr takes Lucene and packages it up as an HTTP server. It runs as a web application inside a servlet container such as Tomcat or Jetty, providing the functionality of Lucene as well as other search capabilities such as faceting. The Lucene functionalities handle distributed search capabilities along with replication for failover and load balancing. Solr also provides easy configuration through XML.
Grant Ingersoll, a Solr and Lucene committer, blogged about the exciting new features planned for the next version of Solr.
- Login or register to post comments
- 10564 reads
- Printer-friendly version
(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)




Comments
Jon Davis replied on Thu, 2010/08/12 - 11:33pm