Apache

  • submit to reddit

Moving readVInt to C

By far the hottest spot in Lucene during searching is the method (DataInput.readVInt) that decodes Lucene's variable-length integer representation (vInt). ...

1 replies - 6724 views - 06/29/11 by Michael Mccandless in News

Solr filters: PatternReplaceCharFilter

Continuing the overview of the filters included in Solr today we look at the PatternReplaceCharFilter. As you might guess the task of the filter is to change...

1 replies - 7737 views - 06/28/11 by Marek Rogoziński in News

Solr filters: KeepWordFilter

This time I decided to look at one of the unusual filters available in the standard distribution of Solr. The first one in my hands is a filter called...

0 replies - 6709 views - 06/27/11 by Rafał Kuć in News

“Car sale application” – SpellCheckComponent – did you really mean that ? (part 5)

The time has come to add another important functionality to our car sale application. It will be the spell checking mechanism with the ability to construct...

1 replies - 4892 views - 06/25/11 by Rafał Andrzejewski in News

Quick look: Solr Frange Queries

In Solr 1.4 there was a new type of query presented: the frange queries. This new type of query lets you search for a range of values. According to the...

0 replies - 12081 views - 06/24/11 by Rafał Kuć in News

“Car sale application” – Unicode Collation, sorting text in a language-sensitive way (part 4)

In the third part of our ”Car sale” application related posts we added some location data and the information about the city that is related to every...

0 replies - 4553 views - 06/22/11 by Rafał Andrzejewski in News

The Invisible Lucene bug fixed point

It turns out, the Jira issue tracking system, which we make heavy use of here at Apache, uses Lucene under the hood for searching and browsing issues. This is...

3 replies - 11145 views - 06/21/11 by Michael Mccandless in News

Indexing files like doc, pdf – Solr and Tika integration

In the previous article we have given basic information about how to enable the indexing of binary files, ie MS Word files, PDF files or LibreOffice ...

1 replies - 8207 views - 06/20/11 by Marek Rogoziński in News

Solr and Tika integration (part 1 – basics)

Indexing the so-called “rich documents”, ie files like pdf, doc, rtf, and so on (or binary files) always required some additional work on the...

2 replies - 8890 views - 06/16/11 by Rafał Kuć in News

Solr, Tomcat and HTTP/1.1 505 HTTP Version Not Supported

During today’s hacking aboot I came across the above error from our Solr query library. The error indicates that some part of Tomcat was unable to parse...

0 replies - 7217 views - 06/15/11 by Mats Lindh in News

Solr: data indexing for fun and profit

Solr is not very friendly to novice users. Preparing good schema file requires some experience. Assuming that we have prepared the configuration files,...

1 replies - 7241 views - 06/14/11 by Marek Rogoziński in News

Dynamic facet population with Solr DataImportHandler

Here's what I'm trying to do: Given this mysql table: CREATE TABLE `tag` ( `id` integer AUTO_INCREMENT NOT NULL PRIMARY KEY, `name` varchar(100) NOT...

0 replies - 5087 views - 06/13/11 by Kelvin Tan in News

”Car sale” application – WordDelimiterFilter and PatternReplaceFilter, helping to improve search results (part 2)

In the first part of our ”Car sale” application related posts we created some standard index structure by properly configuring schema.xml configuration...

0 replies - 5233 views - 06/11/11 by Rafał Andrzejewski in News

Solr Optimization – filter cache

Today’s entry is dedicated to one type of cache in the Solr – filter cache. I will try to explain what it does, how to configure it and how to use it...

0 replies - 5618 views - 06/11/11 by Rafał Kuć in News

“Car sale application” – schema.xml designing to gain what we really need (part 1)

One of the fundamental pieces of solr’s configuration file is the schema.xml file. It is a kind of connector between what we need and what solr...

0 replies - 4718 views - 06/09/11 by Rafał Andrzejewski in News