Popular repositories Loading
-
lunar-lander-dqn-agent
lunar-lander-dqn-agent PublicDeep Q-Learning agent for OpenAI LunarLander-v3 environment.
Python
-
Doctor_Chatbot
Doctor_Chatbot PublicInstruction-tuned LLaMA 2 chatbot fine-tuned with LoRA on real medical Q&A data. Built for conversational health-related queries using Transformers and PEFT.
Python
-
pacman-deepq-learning
pacman-deepq-learning PublicDeep Q-Learning agent trained to play Ms. Pac-Man using a convolutional neural network and experience replay. Built with PyTorch and Gymnasium.
Python
-
kungfu-a3c-agent
kungfu-a3c-agent PublicA3C-style parallel actor-critic agent trained to play Kung Fu Master (Atari) using PyTorch and Gymnasium. Includes parallel environments, shared network, and video playback.
Python
-
-
If the problem persists, check the GitHub status page or contact support.