
Apache Tika's all grown up! A fledgling sub-project of Lucene for two years after emerging from the incubator in 2008, Tika is spreading its wings and...
0 replies - 4663 views - 11/16/11 by David Pell in Articles

Simon Willnauer, the new PMC chair of the Apache Lucene project, recently presented at ApacheCon 2011 on the status of Lucene 4.0. So when is Lucene 4.0...
0 replies - 3492 views - 11/12/11 by Mitch Pronschinske in News

We are pleased to announce that Eric Pugh and David Smiley have published the second Solr book in publication, Apache Solr 3 Enterprise Search Server. It is an...
0 replies - 6348 views - 11/11/11 by Jason Hull in News

In the past, search indices were usually very static: you built them
once, called optimize at the end and shipped them off,
and didn't change them very...
0 replies - 3046 views - 11/10/11 by Michael Mccandless in Articles

The 1.0 release of Apache Tika, a collection of Java libraries for the detection and extraction of structured text and metadata, has been 5 years in the making...
0 replies - 5634 views - 11/09/11 by Mitch Pronschinske in News

This post explains text processing and analytics techniques used at the startup Yieldbot. Their technology uses open source tools including Cascalog, Lucene,...
0 replies - 5693 views - 11/09/11 by Mitch Pronschinske in Articles

This talk by Andrzej Bialecki (Lucid Imagination) will present a design and implementation of a flexible,
version-independent serialization format for Lucene...
0 replies - 3579 views - 11/08/11 by Mitch Pronschinske in Articles

As we wrote a few days ago we are back from this years Lucene Eurocon,
which took place in Barcelona. Despite the fact that the videos will be
available...
3 replies - 4828 views - 11/07/11 by Rafał Kuć in Articles

Last
time, I described the useful SearcherManager class,
coming in the next (3.5.0) Lucene release, to periodically reopen your
IndexSearcher when multiple...
1 replies - 4754 views - 11/05/11 by Michael Mccandless in Articles

Some have probably been expecting it for a long time, and this week it finally happened. Apache Harmony, an open source cleanroom implementation of Java was...
4 replies - 9343 views - 11/04/11 by Mitch Pronschinske in News

Another Lucene Eurocon is a history. That was a very intense two
days, where the basic problem was: which of the three lectures go to.
Sometimes the choice...
0 replies - 3481 views - 10/31/11 by Marek Rogoziński in News

Andrzej Bialecki from Lucid Imagination had a great talk at Apache Lucene Eurocon 2011. Below is the summary and i've attached the slides which are in PDF...
0 replies - 3201 views - 10/29/11 by Mitch Pronschinske in News

Thanks in no small part to Lucene, quality keyword search is easily
obtainable. Likewise, tools like Apache Hadoop and its ecosystem have
made it easier to...
0 replies - 4252 views - 10/29/11 by Grant Ingersoll in Articles

This month’s Wired Magazine features a story on the roots of Hadoop
at Yahoo and the three companies vying to drive its commercial
frontiers farther...
0 replies - 7224 views - 10/28/11 by David Fishman in Articles

This talk on using Hadoop and Solr together for a NoSQL-like result was given by Ken Krugler, a friend of DZone who wrote the amazingly popular article, Solr +...
0 replies - 5692 views - 10/28/11 by Mitch Pronschinske in Videos