I am new to Nutch and Elasticsearch. My requirement is to search parent website as well as other links inside that website, documents for a text/phrase search and display the result. Also we need to display the crawling details for that website.
I am using Nutch 2.3, MySQL and Elasticsearch 6.4.0. I am able to complete the steps for inject, generate, fetch, parse and updateDB.
My questions are:
- Except the database, where can I see the results of the above mentioned processes. For example, if I want to know what happens when generate command runs, what is the output and where it is stored, where can I see.
- How to do indexing in Nutch and integrate the same with elasticsearch. When I execute the index command, it shows "could not find or load class Index".
Comments
Post a Comment