Friday, 25 April 2014
all info about nutch solr hadoop integration
Posted by
Unknown
at
09:52
Apache Nutch is an open source Web crawler written in
Java. By using it, we can find Web page hyperlinks in an automated
manner, reduce lots of maintenance work, for example checking broken
links, and create a copy of all the visited pages for searching over.
That’s where Apache Solr comes in. Solr is an open source full text
search framework, with Solr we can search the visited pages from Nutch.
Luckily, integration between Nutch and Solr is pretty straightforward as
explained below.
more read here http://wiki.apache.org/nutch/NutchTutorial
Nutch 1.3 and Solr Integration
Configure Apache Solr 1.4 with MySQL
http://www.params.me/2011/07/apache-nutch-13-setup.html
Subscribe to:
Post Comments
(
Atom
)
No comments :
Post a Comment