Skip to content
This repository has been archived by the owner on Jul 10, 2019. It is now read-only.

Sandbox

jnioche edited this page Dec 6, 2010 · 2 revisions

THIS WILL BE UPDATED SOON AND IS NO LONGER RELEVANT SINCE THE MERGE WITH THE DEV BRANCH

The sandbox contains custom applications based on Behemoth’s core resources.
SOLR-Indexer

The SOLR Indexer sandbox contains one job class which takes a sequencefile containing BehemothDocuments and generates a number of fields in a SOLRDocument. This is (obviously) sent to SOLR for indexing.

This sandbox application illustrates how to build a custom Behemoth application. The ant building script can be reused for that purpose.

The SOLR indexer can be compiled using :

ant -f $BEHEMOTH_HOME/sandbox/solr-indexer/build.xml

The main job is launched with

./hadoop jar $BEHEMOTH_HOME/sandbox/solr-indexer/build/behemoth-SOLR-0.1-snapshot.job com.digitalpebble.solr.SOLRIndexerJob -conf $BEHEMOTH_HOME/sandbox/solr-indexer/conf/behemoth-site.xml inputAS SOLR_URL

See $BEHEMOTH_HOME/sandbox/solr-indexer/conf/behemoth-site.xml for more details about the parameters used by the SOLR Indexer

Clone this wiki locally