INCA AIMS

INCA aims to provide a bundle of scraping and analysis functionalities for social scientists. The main goals are to facilitate

Data collection from websites and social media.
Basic processing, such as tokenizing, lemmatizing, POS-tagging, NER
Some analyses such as machine learning or time series analysis

INCA USAGE

INCA is currently under heavy development. We cannot guarantee that it works as expected.

For those brave enough:

Direct pip-install, because it's easy (not for development)

pip install git+https://github.com/uvacw/inca.git

Starting INCA using Docker containers

This is the most basic setup for inca in self-built container, without linking it to Elasticsearch.

Starting Elasticsearch for data storage

docker run -it \
  -p 9200:9200 \
  -p 9300:9300 \
  -e "discovery.type=single-node" \
  --name=inca-elastic \
  docker.elastic.co/elasticsearch/elasticsearch:6.8.5

Building the INCA container ...

cd path/to/folder/inca
docker build -t inca .

Run the container

# first time
docker run --name test-inca -it inca python
# thereafter
docker start test-inca
docker attach test-inca

Starting INCA directly on your machine ('bare metal')

Please have a look at the documentation in the inca/doc/ folder.

... and/or use the following to quickly install inca:

Make sure you have the Java Runtime environment, because Elasticsearch (see next step) cannot be installed without. On Ubuntu, you can just do sudo apt-get install default-jre.
Install Elasticsearch 6. You can find instructions here: https://www.elastic.co/guide/en/elasticsearch/reference/current/install-elasticsearch.html
Make sure you have the python3-dev package and the python3-tk package installed (sudo apt-get install python3-dev python3-tk) as well as a c compiler (sudo apt-get install g++).
Make sure you have pip3 (sudo apt install python3-pip) and setuptools installed (sudo pip3 install setuptools)
Then:

pip3 install git+https://github.com/uvacw/inca.git

If you want to use pattern for preprocessing, also do:

sudo apt-get install libmysqlclient-dev
sudo pip3 install pattern

Name		Name	Last commit message	Last commit date
Latest commit History 1,157 Commits
doc		doc
inca		inca
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
REDOWNLOADED.LOG		REDOWNLOADED.LOG
Requirements		Requirements
Requirements-loose		Requirements-loose
setup.py		setup.py
setup_fresh_ubuntu_machine.sh		setup_fresh_ubuntu_machine.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

INCA AIMS

INCA USAGE

For those brave enough:

Direct pip-install, because it's easy (not for development)

Starting INCA using Docker containers

Starting INCA directly on your machine ('bare metal')

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 21

Uh oh!

Languages

uvacw/inca

Folders and files

Latest commit

History

Repository files navigation

INCA AIMS

INCA USAGE

For those brave enough:

Direct pip-install, because it's easy (not for development)

Starting INCA using Docker containers

Starting INCA directly on your machine ('bare metal')

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 21

Uh oh!

Languages

Packages