David has posted 32 posts at DZone. View Full User Profile

Apache Tika: 1 point Oh!

11.16.2011
| 5226 views |
  • submit to reddit

Apache Tika's all grown up!  A fledgling sub-project of Lucene for two years after emerging from the incubator in 2008, Tika is spreading its wings and soaring as an ASF top level project and a leading text extraction library and content detection framework.  This celebratory tone exemplifies the presentation given at ApacheCon NA 2011 by Chris Mattmann, senior computer scientist at the NASA Jet Propulsion Laboratory and adjunct assistant professor at the University of Southern California.  

Mattmann lists the following phenomena as proof that Tika has officially reached a point where it deserves to be referred to as a "mature community":

In November, we hope to have released Tika 1.0. This will coincide with a number of other properties that demonstrate Tika has reached the point of a mature community, including:

1. Concrete, stable features, and core interfaces.
2. Tika's use in multiple programming languages and environments.
3. Our growth in Apache, and election of new committers and PMC members (and ASF members).
4. Developer articles appearing quite frequently on Tika.
5. The culmination of a wealth of knowledge in the form of a book that will be published on Tika at the time of the ApacheCon meeting.


I can't say for sure whether the book has been published, but Tika 1.0 was indeed released on November 7, just in time for ApacheCon, so I suppose congratulations are in order for Tika's reaching yet another goal.

To learn more about how Tika has achieved its success and what comes next for the community, give Mattmann's presentation a listen!

Published at DZone with permission of its author, David Pell.

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)