Donate. I desperately need donations to survive due to my health

Get paid by answering surveys Click here

Click here to donate

Remote/Work from Home jobs

Using Nutch and Elasticsearch together

I am new to Nutch and Elasticsearch. My requirement is to search parent website as well as other links inside that website, documents for a text/phrase search and display the result. Also we need to display the crawling details for that website.

I am using Nutch 2.3, MySQL and Elasticsearch 6.4.0. I am able to complete the steps for inject, generate, fetch, parse and updateDB.

My questions are:

  1. Except the database, where can I see the results of the above mentioned processes. For example, if I want to know what happens when generate command runs, what is the output and where it is stored, where can I see.
  2. How to do indexing in Nutch and integrate the same with elasticsearch. When I execute the index command, it shows "could not find or load class Index".

Comments