OpenMovieData is a project that aims to provide a comprehensive dataset of movies and their associated data. The dataset is intended to be used for data analysis and machine learning projects. Furthermore, these "raw" datasets will be integrated into a Neo4j Property Graph Database to provide a more structured and queryable dataset.
Further documentation on the graph, sources and Licenses can be found on the webpage
If you use this dataset in your research, please cite this Github repository with the following citation:
@misc{OpenMovieData,
author = {Luka van den Boogaard, Marlou Gielen, Sander Moonemans, Luc Siecker},
title = {OpenMovieData},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://sandermoon.github.io/OpenMovieData/}}
Furthermore, please notify the maintainers of this project of your publication, so that we can add it to the list of publications that use this dataset in the citations folder. This will allow us to track any changes in the dataset and to keep track of the usage of the dataset.
Any contributions to the project are welcome. If you have any suggestions or ideas, please open an issue. If you want to contribute to the code or data, please open a pull request. If pull requests are not reviewed in timely fashion, please contact the maintainers of this project throough their Github profile.
The data and code, not under the raw_data directory, in this repository is licensed under the Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0). The Licensing of the datasets in the raw_data folder in the root of this repository can be found in the Sources section on the webpage.