Hadoop is an open source project for processing large datasets in parallel with the use of low level commodity machines.Hadoop
is build on two main parts. An...
0 replies - 51171 views - 03/18/12 by Carlo Scarioni in Articles
Blur is a new Apache 2.0 licensed software project that provides a
search capability built on top of Hadoop and Lucene. Elastic Search and
0 replies - 7679 views - 03/16/12 by Scott Leberknight in Articles
Context-less, multi-term autocomplete is difficult.
Given the term "di", we can look at our index and rank terms starting
with "di" by...
0 replies - 5699 views - 03/16/12 by Kelvin Tan in Articles
In the past, Lucene recorded only the bare minimal aggregate index
statistics necessary to support its
vector space scoring...
0 replies - 7475 views - 03/15/12 by Michael Mccandless in Articles
Solr 3.5 comes with a nice autocomplete/typeahead component that is based on the SolrSpellCheckComponent.
You provide it a query and a field, and the...
0 replies - 6587 views - 03/07/12 by Kelvin Tan in Articles
Many users don't appreciate the transactional semantics of Lucene's
APIs and how this can be useful in search applications. For starters, Lucene...
1 replies - 6153 views - 03/02/12 by Michael Mccandless in Articles
Here’s a brief step by step guide to running more than one instance of Tomcat on a single machine.
Step 1: Install the Tomcat files
2 replies - 18784 views - 02/29/12 by Kief Morris in Articles
0 replies - 20172 views - 02/28/12 by Chris Smith in Articles
Or “How to never re-use cached filter query results even though you meant to”:
Filter queries (“fq” clauses) are a means to restrict the number of...
0 replies - 9824 views - 02/27/12 by Erick Erickson in Articles
I am trying to put together a framework for search quality evaluation for a specialist information provider.
At the moment quality is measured by counting the...
1 replies - 3656 views - 02/26/12 by Tony Russell-rose in Articles
Yonik recently wrote about “Advanced Filter Caching in Solr” where
he talked about expensive and custom filters; it was left as an
exercise to the...
0 replies - 7347 views - 02/23/12 by Erik Hatcher in Articles
The Apache Software Foundation celebrates the 17th Anniversary of the Apache HTTP Server today with the official release of v2.4.Celebrating its 17th...
0 replies - 6030 views - 02/21/12 by Chris Smith in Articles
Recently we had a change to help with a non-commercial project which
included search as its part. One of the assumptions, although not the
key ones, was...
1 replies - 5954 views - 02/21/12 by Rafał Kuć in Articles
Two popular methods of indexing existing data are the Data Import
Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler. These can
be used to index...
1 replies - 8468 views - 02/15/12 by Erick Erickson in Articles
I’ve been spending the last few days at work trying to improve our
search performance, and have been banging my head against the dismax
query target and...
0 replies - 5232 views - 02/14/12 by Geoffrey Papilion in Articles