Portfolio Optimization with Reinforcement Learning

This project uses Reinforcement Learning (PPO) to optimize stock portfolio allocation and combines it with sentiment analysis from real-world financial news.

Environment Requirements

All of the following environment requirements can be found in env.yaml

python >= 3.10 gymnasium pandas numpy yfinance stable-baselines3 tqdm pip transformers datasets kaggle

System Requirements

Developers used the following devices:

Lenovo Thinkpad T14s Gen2
Macbook

We utilized no GPUs while training. Sentiment analysis can be faster with gpu, it currently takes around 30 minutes. Training with 100,000 timesteps takes around 2 minutes.

Command Line Arguments

The following arguments are (optional) used in the program when running main.py.

Argument	Type	Default	Description
`--num_portfolio_stocks`	`int`	`20`	Number of stocks to include in the portfolio.
`--start_date`	`str`	`"2019-05-01"`	Starting date for the portfolio's timeframe (format: YYYY-MM-DD).
`--end_date`	`str`	`"2020-03-25"`	Ending date for the portfolio's timeframe (format: YYYY-MM-DD).
`--stock_index`	`str`	`"nasdaq"`	The stock index to fetch tickers from. Options: `"nasdaq"`, `"nyse"`, or `"all"`.
`--random_seed`	`int`	`42`	Random seed for reproducibility.
`--cache_dir`	`str`	`"./cache/"`	Directory path to store cached content.
`--use_sentiment`	`int`	`0`	Whether to include news sentiment in the optimization strategy. Set to `1` to use sentiment, or `0` to ignore.
`--best_model_path`	`str`	`"./cache/best_model"`	Directory path to store the best-performing model.
`--eval_dir`	`str`	`"./cache/eval"`	Directory path to store evaluation callback results.

Example Usage

python main.py --num_portfolio_stocks 25 --start_date 2020-01-01 --end_date 2021-01-01 --stock_index nyse --use_sentiment 1

Market Data Pipeline

Fetch current list of tickers available on NASDAQ, NYSE, or both
Verify presence of tickers in yfinance api
All verified tickers are treated as valid_tickers

Results are cached, both the valid and invalid tickers, to speed up developement.

Samples n tickers from valid_tickers and treats them as the portfolio_stocks

The tickers are re-validated when sampled due to issues with some tickers getting past verification 1st time

RL Environment

The environment uses OpenAI's Gymnasium and stable-baseline3 for logging, evaluation, and model implementation.

Portfolio

The portfolio is the both the stock portfolio and market, as it considers assets and market conditions.

Agent

The agent uses a Multi-Layer Perceptron policy and PPO algorithm to optimize asset allocation.

Sentiment Analysis

We use FinBERT (a financial-domain BERT model) to classify the sentiment of stock-related news articles.

The sentiment signal is then used to inform portfolio decisions, reflecting the daily tone of the market based on recent news coverage.

The pipeline:

Downloads a large dataset of historical stock news headlines
Filters to the busiest month
Applies FinBERT sentiment analysis
Aggregates into a daily sentiment time series

Output:

daily_sentiment.csv — one row per day with an average sentiment score (float between -1 and 1)

Makefile Commands

Run these commands from the project root:

make: Show available commands
make get-data: Download the stock news dataset from Kaggle
make filter-data n=10000: Filter to the busiest month and keep 10,000 random rows
make get-sentiments: Run FinBERT sentiment pipeline on filtered (or full) dataset
make sentiment n=10000: Run full pipeline: download --> filter --> analyze

Requirements

See env.yaml to set up the Conda environment:

conda env create -f env.yaml
conda activate RL

Also make sure to configure your Kaggle API credentials:

Go to https://www.kaggle.com/account
Create a new API token
Place the downloaded kaggle.json file into:

~/.kaggle/kaggle.json

Or set the following environment variables:

export KAGGLE_USERNAME=your_username
export KAGGLE_KEY=your_key

Note: You must download the data from kaggle, through API, then run sentiment pipeline separtely of the market data pipeline. After creating the `daily_sentiment.csv` file in the `data` directory, you can run `main.py`.

Results

Sentiment	Period	Cum. Return	Avg Return	Volatility	Sharpe (Simple)	Sharpe (Log)
No	2019-05-01 to 2020-03-25	0.8391	-0.0097	0.0617	-0.6289	-0.7474
Yes	2019-05-01 to 2020-03-25	0.8574	-0.0084	0.0598	-0.5619	-0.6804
No	2019-05-01 to 2019-12-31	1.0140	0.0024	0.0090	0.6957	0.6839
Yes	2019-05-01 to 2019-12-31	1.0302	0.0050	0.0088	1.5024	1.4964

Future Work

More algorithms outside of PPO
Implementation of PPO (not using stable-baseline3)
Larger range of sentiment data used.
Store sentiment data and cached data in database.
More robust evaluation pipeline.
Better logging.
Use of technical indicators in data.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
SentimentAnalysis		SentimentAnalysis
__pycache__		__pycache__
cache		cache
results		results
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
CSCI_4170_Final_Project__Portfolio_Optimization_using_Reinforcement_Learning_and_Sentiment_Analysis_to_Navigate_Uncertain_Market_Conditions.pdf		CSCI_4170_Final_Project__Portfolio_Optimization_using_Reinforcement_Learning_and_Sentiment_Analysis_to_Navigate_Uncertain_Market_Conditions.pdf
Makefile		Makefile
README.md		README.md
env.yaml		env.yaml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Portfolio Optimization with Reinforcement Learning

Environment Requirements

System Requirements

Command Line Arguments

Example Usage

Market Data Pipeline

RL Environment

Portfolio

Agent

Sentiment Analysis

Makefile Commands

Requirements

Note: You must download the data from kaggle, through API, then run sentiment pipeline separtely of the market data pipeline. After creating the `daily_sentiment.csv` file in the `data` directory, you can run `main.py`.

Results

Future Work

Resources

About

Uh oh!

Contributors 2

Uh oh!

Languages

NaSirMiller/PortfolioOptimizationUsingRLandNLP

Folders and files

Latest commit

History

Repository files navigation

Portfolio Optimization with Reinforcement Learning

Environment Requirements

System Requirements

Command Line Arguments

Example Usage

Market Data Pipeline

RL Environment

Portfolio

Agent

Sentiment Analysis

Makefile Commands

Requirements

Note: You must download the data from kaggle, through API, then run sentiment pipeline separtely of the market data pipeline. After creating the daily_sentiment.csv file in the data directory, you can run main.py.

Results

Future Work

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages

Note: You must download the data from kaggle, through API, then run sentiment pipeline separtely of the market data pipeline. After creating the `daily_sentiment.csv` file in the `data` directory, you can run `main.py`.