Recently we had a change to help with a non-commercial project which
included search as its part. One of the assumptions, although not the
key ones, was...
1 replies - 6022 views - 02/21/12 by Rafał Kuć in Articles
Two popular methods of indexing existing data are the Data Import
Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler. These can
be used to index...
1 replies - 8548 views - 02/15/12 by Erick Erickson in Articles
One of the features of the latest Solr version (3.5)
is the ability to identify the language of the document during its
indexation. In today's entry we...
0 replies - 5145 views - 02/06/12 by Rafał Kuć in Articles
The 1.0 release of Apache Tika, a collection of Java libraries for the detection and extraction of structured text and metadata, has been 5 years in the making...
0 replies - 6376 views - 11/09/11 by Mitch Pronschinske in News
To get a sense of the accuracy and performance of Google's Compact
Language Detector, I ran some tests against two other packages:
1 replies - 6433 views - 10/26/11 by Michael Mccandless in Articles
In 2006, Solr was donated to the Apache Foundation and integrated into the Lucene project. Apache Solr is an enterprise search platform that powers the...
2 replies - 11691 views - 11/16/09 by Mitch Pronschinske in Articles