Mark Miller is a member of the Lucene PMC, a Lucene Committer and a Solr Committer. He's most recently been leading work on Solr Cloud and per-segment searching. Mark has posted 3 posts at DZone. You can read more from them at their website. View Full User Profile

Lucene/Solr 3.5 Released! See the Highlights

11.28.2011
| 5860 views |
  • submit to reddit

The Lucene PMC is pleased to announce the release of Apache Lucene 3.5.0.

 

Apache Lucene™ 3.5.0 available

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

 

This release contains numerous bug fixes, optimizations, and

improvements, some of which are highlighted below.  The release

is available for immediate download at:

 

http://www.apache.org/dyn/closer.cgi/lucene/java (see note below).

 

See the CHANGES.txt file included with the release for a full list of

details.

 

Lucene 3.5.0 Release Highlights:

 

* Added a very substantial (3-5X) RAM reduction required to hold the

terms index on opening an IndexReader. (LUCENE-2205)

 

* Added IndexSearcher.searchAfter which returns results after a

specified ScoreDoc (e.g. last document on the previous page) to

support deep paging use cases. (LUCENE-2215)

 

* Added SearcherManager to manage sharing and reopening IndexSearchers

across multiple search threads. Underlying IndexReader instances are

safely closed if not referenced anymore. (LUCENE-3445, LUCENE-3558)

 

* Added SearcherLifetimeManager which safely provides a consistent

view of the index across multiple requests (e.g. paging/drilldown).

(LUCENE-3558, LUCENE-3486)

 

* Renamed IndexWriter.optimize to forceMerge to discourage use of

this method since it is horribly costly and rarely justified

anymore. (LUCENE-3439)

 

* Added NGramPhraseQuery that speeds up phrase queries 30-50%

when n-gram analysis is used. (LUCENE-3426)

 

* Added a new reopen API (IndexReader.openIfChanged) that

returns null instead of the old reader if there are no changes

in the index. (LUCENE-3464)

 

* Improvements to vector highlighting: support for more queries

such as wildcards and boundary analysis for generated snippets

(LUCENE-1824, LUCENE-1889)

 

* IndexSearcher and IndexReader now perform additional checks to

throw AlreadyClosedExceptions if searches are performed on a

closed IndexReader. Performing searches on already closed reader

can cause JVM crashes when invalid memory mapped files are

referenced.

 

* Several bugfixes, including a bug where closing an NRT reader

after the writer was closed was incorrectly invoking the

DeletionPolicy. See CHANGES.txt entries for full details.

 

 

Apache Solr™ 3.5.0 available

The Lucene PMC is pleased to announce the release of Apache Solr 3.5.0.

 

Solr is the popular, blazing fast open source enterprise search platform from

the Apache Lucene project. Its major features include powerful full-text

search, hit highlighting, faceted search, dynamic clustering, database

integration, rich document (e.g., Word, PDF) handling, and geospatial search.

Solr is highly scalable, providing distributed search and index replication,

and it powers the search and navigation features of many of the world’s

largest internet sites.

 

This release contains numerous bug fixes, optimizations, and

improvements, some of which are highlighted below.  The release

is available for immediate download at:

http://www.apache.org/dyn/closer.cgi/lucene/solr (see note below).

 

See the CHANGES.txt file included with the release for a full list of

details.

 

Solr 3.5.0 Release Highlights:

 

* Bug fixes and improvements from Apache Lucene 3.5.0, including a

very substantial (3-5X) RAM reduction required to hold the terms

index on opening an IndexReader. (LUCENE-2205)

 

* Added support for distributed result grouping. (SOLR-2066,

SOLR-2776)

 

* Added support for Hunspell stemmer TokenFilter supporting stemming

for 99 languages. (SOLR-2769)

 

* A new contrib module “langid” adds language identification

capabilities as an Update Processor, using Tika’s

LanguageIdentifier or Cybozu language-detection library (SOLR-1979)

 

* Numeric types including Trie and date types now support

sortMissingFirst/Last. (SOLR-2881)

 

* Added hl.q parameter. It is optional and if it is specified, it overrides

q parameter in Highlighter. (SOLR-1926)

 

* Several minor bugfixes like date parsing for years from 0001-1000, ignored

configurations when using QueryAnalyzer with SpellCheckComponent

and many more.

See CHANGES.txt entries for full details.

 

 

Note: The Apache Software Foundation uses an extensive mirroring network for

distributing releases.  It is possible that the mirror you are using may not

have replicated the release yet.  If that is the case, please try another

mirror.  This also goes for Maven access.

 

Happy searching,

 

Apache Lucene/Solr Developers

References
Published at DZone with permission of its author, Mark Miller. (source)

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)

Tags:

Comments

John David replied on Wed, 2012/01/25 - 7:08pm

Wow lot of new things are offered in the Lucene 3.5.

What Is Apache Lucene?

The Apache Lucene™ project develops open-source search software, including:

    Apache Lucene Core™ (formerly named Lucene Java), our flagship sub-project, provides a Java-based indexing and

search implementation, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities.
    Apache Solr™ is our high performance enterprise search server, with XML/HTTP and JSON/Python/Ruby APIs, hit

highlighting, faceted search, caching, replication, distributed search, database integration, web admin and search

interfaces.
    Apache PyLucene™ is a Python port of the the Lucene Core project.
    Apache Open Relevance Project™ is a subproject with the aim of collecting and distributing free materials for

relevance testing and performance.

new java

Carla Brian replied on Sat, 2012/06/02 - 11:37am

This software is is a high-performance, full-featured text search library. Solr is a standalone search server that uses Lucene at its core for indexing and search. - James Stuckey

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.