Tuesday 19 April 2011

Magnolia Apache Solr integration

Last week I finally released the magnolia-solr-module on Magnolia's Forge. The module aims at bringing Apache Solr outstanding search features into Magnolia. For those who don't know Solr:

"Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites."

Solr is used by some of the largest companies in the world such as apple, ebay, zappos, gettyimages and salesforce, to name just a few. Recently The Guardian (which has the second highest readership of any on-line news site after the New York Times) has chosen Solr for its content API.

The reason for integrating Magnolia and Solr is a very simple one: have the best open source tool for a given task do the job. In my case, I like to manage and publish contents with Magnolia CMS (of course, I am biased towards Magnolia ;)) and its easy to use, intuitive interface, and then index and search those contents with Solr.

In the step-by-step tutorial accompanying the module I explain how to achieve this. There I also explain how to customize the module in case its default behavior does not suit your needs.

So, if you need blazing fast search for your Magnolia-based website, give the module a go and enjoy Magnolia+Solr integration!