Skip to content

data import command on page 149 does not complete #24

@pauljabernathy

Description

@pauljabernathy

I followed the directions on the github main page to download the code, ran mvn package, and went to the bin directory and ran ./start-solr.sh solr-clustering &. So far so good.

But when I went to http://localhost:8983/solr/dataimport?command=full-import, but the data import could not complete. The text in the status message was "Indexing failed. Rolled back all changes." In the console, I found this error message:

SEVERE: Exception thrown while getting data
java.io.IOException: Server returned HTTP response code: 403 for URL: http://www.startribune.com/sports/index.rss2
at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1626)
...

For whatever reason, the script seems unable to get to the url http://www.startribune.com/sports/index.rss2. However, I can get to the url from the browser window. Is there a known issue with getting to this page from the solr example?

I am trying to do Mahout clustering and have been getting errors when trying to cluster some other documents (the docs that came with solr 4.6.1) so now I am trying to follow the book's examples exactly so I could guarantee that the clustering process will run correctly.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions