Learn MCP Development by Example A complete, production-ready Model Context Protocol (MCP) server that demonstrates best practices for building intelligent document processing tools.
This is a Model Context Protocol (MCP) server that intelligently splits Markdown documents into hierarchical sections while preserving parent-child relationships and sibling connections. Built with FastMCP and FastAPI, it serves as both a useful tool and an educational resource for developers learning to create their own MCP servers. The project includes automated testing and deployment via GitHub Actions workflows.
The Model Context Protocol enables AI assistants to access external tools and data sources securely and efficiently. This server demonstrates how to:
- Build MCP-compliant tools that AI models can use
- Handle structured document processing
- Maintain relationships in hierarchical data
- Integrate with modern Python web frameworks
- Hierarchical Splitting: Maintains header relationships (H1 → H2 → H3...)
- Sibling Detection: Identifies headers at the same level for navigation
- Code-Aware Processing: Ignores
#comments in fenced code blocks - Rich Metadata: Includes parent references, sibling lists, and optional token counts
- MCP Compliant: Full Model Context Protocol interface support
- FastAPI Powered: High-performance async server with automatic OpenAPI docs
- Well Tested: Core splitting functionality covered with pytest
- Type Safety: Complete type hints with Pydantic models
- Docker Ready: Containerized deployment support
- CI/CD: Automated testing and deployment via GitHub Actions
- Python >=3.12, <3.13
This project uses Poetry for dependency management. To get started:
git clone https://github.com/Gal-Gilor/markdown-mcp.git
cd markdown-mcp
python -m venv .venv && source .venv/bin/activate
poetry install
python src/main.pyTo run the MCP server (if the environment is activated):
python src/main.pyIf the environment is not activated, you can run using Poetry:
poetry run python src/main.py-
The server will start on
http://0.0.0.0:8080. -
The MCP server is mounted at
/server. -
The MCP server is accessible at
/server/mcp. -
Available tools:
-
split_text(text: str) -> list[Section]: Splits the input Markdowntextinto a list ofSectionobjects.{ "section_header": "Getting Started", "section_text": "Welcome to the guide...", "header_level": 2, "metadata": { "parents": {"h1": "Introduction"}, "siblings": ["Advanced Topics", "FAQ"] } }Example Request:
curl -X POST http://localhost:8080/server/mcp/tools/call \ -H "Content-Type: application/json" \ -H "Accept: application/json, text/event-stream" \ -d '{ "jsonrpc": "2.0", "id": 1, "method": "tools/call", "params": { "name": "split_text", "arguments": { "text": "# Header 1\n\nSome content here.\n\n## Header 2\n\nMore content." } } }'
-
src/
├── main.py # FastAPI app and MCP server setup
├── models.py # Pydantic data models
└── splitter.py # Core splitting logic
tests/
├── conftest.py # Test fixtures
└── test_splitter.py # Splitter functionality tests
- FastMCP Integration: Uses
@mcp.tooldecorator to expose functions as MCP tools - Pydantic Models: Type-safe data structures with validation
- Hierarchical Processing: Two-pass algorithm for building document structure
- Code Block Protection: Regex-based approach to ignore comments in code
- Code Block Processing: Protects
#comments in fenced code blocks - Header Detection: Uses regex pattern
^(#+)\s+(.+)$to find headers - Hierarchy Building: Maintains parent-child relationships using a stack
- Sibling Detection: Groups headers at same level with same parent
- Section Creation: Converts hierarchical structure to flat list of sections
python -m venv .venv && source .venv/bin/activate
poetry installmake lint: Run ruff for code formatting and lintingmake test: Run pytest test suitepython src/main.py: Start the development server
# Run all tests
poetry run pytest
# Run with verbose output
poetry run pytest -v
# Run specific test file
poetry run pytest tests/test_splitter.pyThe project includes GitHub Actions workflows for automated testing and deployment:
- Execute Tests (
.github/workflows/execute_main_tests.yaml): Runs pytest test suite on push/PR to main branch - Deploy to Cloud Run (
.github/workflows/deploy_main.yaml): Automatically deploys to Google Cloud Run on push to main branch