Curated RSS feeds for data platform engineers, backend engineers transitioning to data, and infrastructure-focused data engineers.
Maintained by a Staff Software Engineer with 15+ years in backend engineering, distributed systems, and platform engineering, now focused on data platforms. This list powers an AI-powered reading assistant that processes ~1,000 articles daily.
Why this list is different: Most data engineering lists focus purely on analytics tools. This list covers the full stack:
- ✅ Data pipelines (Python, Airflow, Spark, Databricks)
- ✅ Backend systems (Python, Java, microservices, distributed systems)
- ✅ Infrastructure (Kubernetes, databases, service mesh)
- ✅ ML operations (MLOps, model deployment)
Perfect for:
- Backend engineers transitioning to data engineering
- Data engineers who manage infrastructure
- Platform engineers building data platforms
- Anyone who needs to understand data systems end-to-end
The data engineering landscape produces overwhelming content. This curated collection focuses on signal over noise—feeds that provide technical depth, practical insights, and keep you current without drowning in marketing fluff.
- Quick Start
- Data Engineering
- Backend Engineering & Distributed Systems
- Databases & Storage
- MLOps & AI Engineering
- Platform Engineering & Cloud Native
- Streaming & Real-Time Data
- Infrastructure as Code
- Company Engineering Blogs
- Individual Bloggers
- Newsletters & Curated Content
- Learning Resources
- How to Use This List
- Contributing
Import all feeds at once: Download the OPML file and import into your RSS reader (Inoreader, Feedly, NewsBlur, etc.)
Or browse by category below and subscribe to individual feeds that interest you.
Feeds focused on data pipelines, data warehouses, analytics engineering, and modern data platforms.
-
Databricks Blog - Apache Spark, Delta Lake, Unity Catalog, and data lakehouse architecture. Essential for modern data engineering.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.databricks.com/blog/rss.xml
-
dbt Blog - Analytics engineering, data transformation, and the metrics layer. Pioneering the analytics engineer role.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.getdbt.com/blog/rss.xml
-
Locally Optimistic - Community blog on data in production. Real-world experiences from data leaders across industries.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://locallyoptimistic.com/rss/
-
The Data Engineering Podcast - In-depth interviews with data engineering practitioners building production systems.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.dataengineeringpodcast.com/feed/
-
Seattle Data Guy - Practical tutorials, career advice, and data engineering fundamentals. Great for learning.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://seattledataguy.substack.com/feed
-
Modern Data 101 - Modern data stack trends, tools, and practices.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://moderndata101.substack.com/feed
-
MotherDuck Blog - DuckDB, serverless analytics, and modern OLAP database architecture.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://motherduck.com/blog/rss.xml
-
ClickHouse Blog - OLAP databases, real-time analytics, and columnar storage at scale.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://clickhouse.com/blog/rss
-
Snowflake Blog - Data warehouse architecture, data sharing, and cloud data platforms.
- 📅 Updates: 3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.snowflake.com/blog/rss/
-
Trino Blog - Distributed SQL query engine, data federation, and query optimization.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://trino.io/blog/rss.xml
-
Data Engineering Weekly - Curated newsletter highlighting the best data engineering articles each week.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.dataengineeringweekly.com/feed.xml
Feeds on backend architectures, microservices, distributed systems theory, and the infrastructure that data pipelines run on. Essential for data platform engineers.
Python is the primary language for modern data engineering. These feeds cover Python fundamentals, best practices, and data-focused applications.
-
Real Python - Comprehensive Python tutorials, best practices, and deep dives. Essential for Python developers at all levels.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://realpython.com/atom.xml
-
Talk Python Podcast - Weekly Python podcast with industry practitioners. Deep technical discussions on Python in production.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://talkpython.fm/episodes/rss
-
Python Bytes Podcast - Weekly Python news and headlines. Quick 30-minute format covering what's new in Python.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://pythonbytes.fm/episodes/rss
-
Planet Python - Aggregator of Python blogs from the community. Curated feed of quality Python content.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://planetpython.org/rss20.xml
-
Full Stack Python - Python web development, deployment, and best practices. Great for building data APIs.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.fullstackpython.com/feed
-
ArjanCodes - Python software design, architecture patterns, and clean code. Excellent for writing maintainable data pipelines.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.arjancodes.com/blog/rss.xml
-
Towards Data Science - Data science, ML, and analytics with Python. Large community publication on Medium.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://towardsdatascience.com/feed
-
KDnuggets - Data science, ML, and AI news. One of the oldest and most respected data science communities.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.kdnuggets.com/feed
-
PyBites - Python challenges, code patterns, and best practices. Strong focus on practical Python skills.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://pybit.es/feeds/all.rss.xml
-
Mouse Vs Python - Python tutorials, book reviews, and library deep dives. Great for discovering Python tools.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.blog.pythonlibrary.org/feed/
-
Baeldung - Comprehensive Java tutorials, Spring Framework, microservices patterns, and REST APIs. Practical and example-driven.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.baeldung.com/feed/
-
Inside Java - Official Java news from Oracle. JEPs, performance improvements, new language features, and JVM internals.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://inside.java/rss.xml
-
Foojay - Friends of OpenJDK community. Java ecosystem news, library updates, and community perspectives.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://foojay.io/feed/
-
Spring Blog - Official Spring Framework updates, guides, best practices, and release announcements.
- 📅 Updates: 3x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://spring.io/blog.atom
-
Vlad Mihalcea - Hibernate, JPA, JDBC, and database performance tuning for Java applications. Deep technical content.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://vladmihalcea.com/feed/
-
Microservices.io - Chris Richardson's authoritative patterns for microservices architecture, event-driven systems, and SAGA pattern.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://microservices.io/feed.xml
-
High Scalability - Case studies of systems at scale. How companies like Netflix, Uber, and Pinterest architect their platforms.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
http://highscalability.com/rss.xml
-
Istio Blog - Service mesh patterns, microservices networking, traffic management, and observability.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://istio.io/latest/blog/index.xml
-
Linkerd Blog - Lightweight service mesh alternative. Simpler than Istio, focused on reliability and performance.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://linkerd.io/blog/index.xml
Note: Distributed Systems feeds moved here from separate section for better organization
-
Martin Kleppmann - Author of "Designing Data-Intensive Applications." Distributed systems research and theory.
- 📅 Updates: Quarterly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://martin.kleppmann.com/feed.xml
-
Marc Brooker's Blog - AWS VP on distributed systems, formal methods, reliability engineering, and large-scale systems.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://brooker.co.za/blog/rss.xml
-
All Things Distributed - Werner Vogels (AWS CTO) on distributed systems architecture and cloud computing.
- 📅 Updates: Quarterly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.allthingsdistributed.com/atom.xml
-
Jepsen / Aphyr - Kyle Kingsbury's distributed systems testing, consistency analysis, and breaking databases.
- 📅 Updates: Occasional | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://aphyr.com/rss
-
The Morning Paper - Adrian Colyer's summaries of CS research papers. Note: Paused but archive is gold.
- 📅 Updates: Archive | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://blog.acolyer.org/feed/
-
Murat Demirbas - Distributed systems professor, paper reviews, and research insights.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
http://muratbuffalo.blogspot.com/feeds/posts/default
Deep dives into database internals, performance tuning, query optimization, and data storage systems.
-
PostgreSQL Blog - Official Postgres news, releases, and community updates.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.postgresql.org/rss/news.rss
-
Cybertec PostgreSQL Blog - Advanced Postgres tuning, replication, high availability, and internals.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.cybertec-postgresql.com/feed/
-
Crunchy Data Blog - Postgres at scale, cloud deployments, operators, and container-based Postgres.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.crunchydata.com/blog/rss.xml
- Percona Blog - MySQL and MongoDB performance, optimization, and high availability.
- 📅 Updates: 3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.percona.com/blog/feed/
- Use The Index, Luke - SQL indexing, query performance, and optimization across databases.
- 📅 Updates: Quarterly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://use-the-index-luke.com/blog/feed
Note: OLAP databases (ClickHouse, Snowflake, etc.) are in the Data Engineering section above.
Feeds covering machine learning operations, ML infrastructure, LLM applications, and production ML systems.
-
Eugene Yan - Applied ML, recommendation systems, and ML in production. Former Amazon/Lazada ML engineer.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://eugeneyan.com/feed.xml
-
Chip Huyen - ML systems design, MLOps, and real-time ML. Author of "Designing Machine Learning Systems."
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://huyenchip.com/feed.xml
-
MLOps Community - Best practices, case studies, and tools for operationalizing ML.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://mlops.community/feed/
-
Made With ML - Production ML, MLOps, and building ML products that scale.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://madewithml.com/rss.xml
-
Hugging Face Blog - Transformers, LLMs, diffusion models, and open-source AI.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://huggingface.co/blog/feed.xml
-
OpenAI Blog - AI research, GPT models, and frontier AI systems.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://openai.com/blog/rss.xml
-
Latent Space - AI engineering podcast and newsletter. Practical insights on building with LLMs.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.latent.space/feed
-
Neptune.ai Blog - Experiment tracking, model registry, and ML metadata management.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://neptune.ai/blog/rss.xml
-
Weights & Biases Blog - ML experiment tracking, model versioning, and collaborative ML.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://wandb.ai/blog/rss.xml
-
Evidently AI Blog - ML observability, model monitoring, and data drift detection.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.evidentlyai.com/blog/rss
Feeds on Kubernetes, infrastructure automation, DevOps, SRE, and building developer platforms.
-
Kubernetes Blog - Official K8s updates, features, and best practices.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://kubernetes.io/feed.xml
-
CNCF Blog - Cloud Native Computing Foundation ecosystem updates.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.cncf.io/feed/
-
The New Stack - Cloud native, containers, microservices, and platform engineering.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://thenewstack.io/feed/
-
Honeycomb Blog - Observability, distributed tracing, and debugging production systems. Co-founded by Charity Majors.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.honeycomb.io/blog/rss.xml
-
Platform Engineering - Building internal developer platforms and improving developer experience.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://platformengineering.org/blog/rss.xml
-
Grafana Labs Blog - Monitoring, visualization, and observability stack.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://grafana.com/blog/index.xml
-
Last Week in AWS - AWS news, cost optimization, and cloud infrastructure with humor. By Corey Quinn.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.lastweekinaws.com/feed/
Feeds on stream processing, event-driven architectures, Kafka, and real-time analytics.
-
Confluent Blog - Apache Kafka, stream processing, and event-driven architecture.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.confluent.io/blog/feed/
-
Apache Flink Blog - Stream processing, stateful computations, and real-time analytics.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://flink.apache.org/blog/feed.xml
-
RisingWave - Streaming databases and continuous SQL queries.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://risingwave.com/blog/rss.xml
-
Materialize Blog - Streaming SQL, incremental view maintenance, and real-time data warehousing.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://materialize.com/blog/rss.xml
-
Tinybird Blog - Real-time analytics APIs, low-latency data platforms.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.tinybird.co/blog-posts/rss.xml
Feeds on Terraform, infrastructure automation, configuration management, and cloud resource provisioning.
-
HashiCorp Blog - Terraform, Vault, Consul, and infrastructure as code best practices.
- 📅 Updates: 2-3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.hashicorp.com/blog/feed.xml
-
Pulumi Blog - Infrastructure as code using real programming languages (TypeScript, Python, Go).
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.pulumi.com/blog/rss.xml
-
env0 Blog - Infrastructure as code automation, Terraform management, and IaC governance.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐
- RSS:
https://www.env0.com/blog/rss.xml
Note: This section has been merged into "Backend Engineering & Distributed Systems" above for better organization.
Engineering blogs from tech companies building at scale.
-
Netflix Tech Blog - Microservices, chaos engineering, data pipelines, and streaming at scale.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://netflixtechblog.com/feed
-
Uber Engineering - Large-scale data systems, real-time analytics, and platform engineering.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.uber.com/en-BG/blog/engineering/rss/
-
Slack Engineering - Real-time messaging infrastructure, databases, and developer tools.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://slack.engineering/feed
-
GitHub Engineering - Git infrastructure, database migrations, and developer experience.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://github.blog/engineering/feed/
-
Cloudflare Blog - Edge computing, CDN, DDoS mitigation, and network performance.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://blog.cloudflare.com/rss/
-
AWS Compute Blog - Lambda, ECS, EC2, and serverless architecture patterns.
- 📅 Updates: 3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://aws.amazon.com/blogs/compute/feed/
-
Google Cloud Platform Blog - GCP services, BigQuery, and cloud architecture.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://cloudblog.withgoogle.com/rss/
-
Stripe Engineering - Payment infrastructure, API design, and reliability engineering.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://stripe.com/blog/feed.rss
-
LinkedIn Engineering - Data infrastructure, AI/ML at scale, and social networks.
- 📅 Updates: 2x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://engineering.linkedin.com/blog.rss.html
-
Airbnb Engineering - Data science, ML, and infrastructure at scale.
- 📅 Updates: 2x/month | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://medium.com/feed/airbnb-engineering
Personal blogs from exceptional engineers and thought leaders.
-
Dan Luu - Systems performance, latency analysis, and hardware-software interaction. Essential reading.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://danluu.com/atom.xml
-
Brendan Gregg - Performance engineering, profiling, flame graphs, and eBPF.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://www.brendangregg.com/blog/rss.xml
-
Julia Evans - Systems debugging, networking, and making hard things accessible. Creator of Wizard Zines.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://jvns.ca/atom.xml
-
Denis Bakhvalov - CPU performance, compiler optimizations, and low-level performance tuning.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://easyperf.net/feed.xml
-
Daniel Lemire - Algorithms, performance optimization, and software engineering research.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://lemire.me/blog/feed/
-
Simon Willison - Python, LLMs, datasette, and AI applications. Creator of Datasette.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://simonwillison.net/atom/everything/
-
The Pragmatic Engineer - Gergely Orosz on engineering culture, big tech, and career growth.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://newsletter.pragmaticengineer.com/feed
-
Martin Fowler - Software architecture, refactoring, and agile development.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://martinfowler.com/feed.atom
-
Chelsea Troy - Software maintenance, technical leadership, and code comprehension.
- 📅 Updates: Monthly | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://chelseatroy.com/feed/
-
Coding Horror - Jeff Atwood (Stack Overflow co-founder) on programming and software design.
- 📅 Updates: Occasional | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://blog.codinghorror.com/rss/
-
Jessie Frazelle - Containers, security, and systems programming.
- 📅 Updates: Occasional | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://blog.jessfraz.com/index.xml
-
Cindy Sridharan - Distributed systems, observability, and testing in production.
- 📅 Updates: Quarterly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://copyconstruct.medium.com/feed
High-quality newsletters and curated content aggregators.
-
Hacker News - Tech news and discussion from the startup community.
- 📅 Updates: Continuous | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://news.ycombinator.com/rssorhttps://hnrss.org/best(top stories only)
-
TLDR Newsletter - Daily tech news digest in 5 minutes.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://www.tldrnewsletter.com/rss
-
ByteByteGo - Alex Xu's system design newsletter and diagrams.
- 📅 Updates: Weekly | 📊 Quality: ⭐⭐⭐⭐⭐
- RSS:
https://blog.bytebytego.com/feed
-
Quastor - System design case studies from big tech companies.
- 📅 Updates: 3x/week | 📊 Quality: ⭐⭐⭐⭐
- RSS:
https://rss.beehiiv.com/feeds/nczRb4PQ6t.xml
-
Techpresso - Daily tech news summary.
- 📅 Updates: Daily | 📊 Quality: ⭐⭐⭐
- RSS:
https://www.dupple.com/techpresso-archives/rss.xml
While this list focuses on RSS feeds for staying current with ongoing developments, here are exceptional university courses and resources for deep technical learning:
Data Systems & Databases:
-
CMU 15-445: Intro to Database Systems - Andy Pavlo's legendary database fundamentals course. Query processing, storage, indexing, and transactions. Start here for DB internals.
-
CMU 15-721: Advanced Database Systems - Advanced topics in database internals. OLAP systems, in-memory databases, query optimization. Essential for understanding modern data warehouses.
-
Stanford CS246: Mining Massive Datasets - MapReduce, Spark, large-scale data processing algorithms. Link analysis, clustering, and dimensionality reduction.
Distributed Systems:
-
MIT 6.824: Distributed Systems - Robert Morris's legendary course. Raft consensus, fault tolerance, distributed transactions. Required reading for serious distributed systems work.
-
Martin Kleppmann's Distributed Systems Course - Author of "Designing Data-Intensive Applications." Consistency, replication, consensus algorithms.
AI & ML Systems:
-
UC Berkeley CS294: AI Agents - LLM agents, agentic systems, and autonomous AI. Cutting-edge research from 2024-2025.
-
Stanford CS329S: Machine Learning Systems Design - ML in production, deployment patterns, monitoring, and MLOps. Chip Huyen's course.
-
Full Stack Deep Learning - End-to-end ML systems. From problem framing to deployment and monitoring.
Data Engineering:
- DataTalks.Club Data Engineering Zoomcamp - Free hands-on course covering modern data stack: dbt, Airflow, Spark, cloud warehouses.
- Designing Data-Intensive Applications - Martin Kleppmann (The data engineering bible)
- Database Internals - Alex Petrov (How databases work under the hood)
- Fundamentals of Data Engineering - Joe Reis & Matt Housley (Modern data eng practices)
- Streaming Systems - Tyler Akidau et al. (Stream processing concepts)
- The Morning Paper Archive - 1,000+ CS paper summaries by Adrian Colyer
- Papers We Love - Community repository of academic CS papers
- Google Research Publications - MapReduce, Spanner, Bigtable, etc.
Note: These are learning resources, not RSS feeds. Bookmark for structured deep dives alongside the RSS feeds above for staying current.
Browse categories above and subscribe to individual feeds in your RSS reader.
- Download
feeds.opml - Import into your RSS reader:
- Inoreader: Settings → Import/Export → Import from OPML
- Feedly: Add Content → Import OPML
- NewsBlur: Import → Upload OPML file
- Any RSS reader: Look for Import/OPML option
Use my Content Intelligence Platform that:
- Analyzes all these feeds (~1,000 articles/day)
- Scores each by personal relevance (0-100)
- Assesses quality signal (validated vs emerging)
- Surfaces top 10-20 articles daily
- Provides conversational interface to ask "What should I read about Kubernetes?"
Time saved: 30 minutes/day → 5 minutes/day
Feeds are included based on:
✅ Technical Depth - Deep technical content, not marketing fluff
✅ Regular Updates - Active within last 3 months
✅ Practitioner Focus - Written by people building real systems
✅ Signal-to-Noise - High value per article, not quantity over quality
✅ Relevance - Directly applicable to data/platform/ML engineering
Feeds are removed if they:
❌ Become inactive (no posts in 6+ months)
❌ Shift to marketing/promotional content
❌ Decrease in quality or technical depth
❌ No longer relevant to data engineering
Suggestions welcome! Know a great feed that's missing?
- Check that it meets the curation criteria
- Verify it's been active in the last 3 months
- Open a pull request or issue with:
- Feed name and URL
- RSS feed URL
- Why it's valuable (1-2 sentences)
- Suggested category
See CONTRIBUTING.md for detailed guidelines.
- awesome-data-engineering - Tools and resources for data engineering
- awesome-mlops - MLOps tools and practices
- awesome-kubernetes - Kubernetes resources
Maintained by: @yourusername
Experience: Staff Software Engineer, 15+ years in backend and distributed systems
Portfolio: Content Intelligence Platform
Blog: Medium @random.droid
This list powers my daily workflow. Every morning, my AI assistant processes these ~1,000 articles and surfaces the most relevant content. I'm sharing the input sources to help others discover quality content.
Last Updated: January 2026
- Total Feeds: 106 RSS feeds
- Categories: 11
- Articles/Day: ~1,200+
- Curated Output: 10-20/day via AI
- Time Saved: 25 minutes/day
- Languages Covered: Python, Java, SQL, and more
To the extent possible under law, the author has waived all copyright and related rights to this work.
Found this useful? Give it a ⭐ and share with your team!
