This code is a set of runnable examples for the blog series: Fundamentals of data processing for SciFi geeks
- Install maven: https://maven.apache.org/install.html
- Clone the repo.
Run the command from the repo directory:
- Download "The Complete Works of William Shakespeare":
wget -O ./data/shakespeare.txt http://www.gutenberg.org/files/100/100-0.txt
- Run the examples console:
mvn -q clean install exec:java
You will see the prompt:
Your wish is my command.
>>>
You can run the following commands:
- iterator_topk
- iterator_word_count
- word_count_parallel
- word_count_serial
- word_count_rdd
- exit