Recently we had a change to help with a non-commercial project which
included search as its part. One of the assumptions, although not the
key ones, was...
1 replies - 6382 views - 02/21/12 by Rafał Kuć in Articles
Two popular methods of indexing existing data are the Data Import
Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler. These can
be used to index...
1 replies - 9745 views - 02/15/12 by Erick Erickson in Articles
One of the features of the latest Solr version (3.5)
is the ability to identify the language of the document during its
indexation. In today's entry we...
0 replies - 5400 views - 02/06/12 by Rafał Kuć in Articles
The 1.0 release of Apache Tika, a collection of Java libraries for the detection and extraction of structured text and metadata, has been 5 years in the making...
0 replies - 6600 views - 11/09/11 by Mitch Pronschinske in News
To get a sense of the accuracy and performance of Google's Compact
Language Detector, I ran some tests against two other packages:
1 replies - 6691 views - 10/26/11 by Michael Mccandless in Articles
In 2006, Solr was donated to the Apache Foundation and integrated into the Lucene project. Apache Solr is an enterprise search platform that powers the...
2 replies - 11850 views - 11/16/09 by Mitch Pronschinske in Articles