Skip to content

Commit 6f0846c

Browse files
Re-add MkDocs documentation (#31)
* docs: re-add MkDocs documentation
1 parent 5f4a16b commit 6f0846c

File tree

18 files changed

+199
-4
lines changed

18 files changed

+199
-4
lines changed

contributing/DEVELOPMENT.md

Lines changed: 21 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -111,10 +111,27 @@ pre-commit run --all-files
111111

112112
### Building Documentation
113113

114-
The project previously used MkDocs for documentation. Documentation now lives primarily in:
115-
- README.md - Main documentation
116-
- Docstrings in source code
117-
- Contributing guides in /contributing
114+
MkDocs (Material theme) powers the public documentation site hosted at `https://allisonwang-db.github.io/pyspark-data-sources/`.
115+
116+
#### Preview Locally
117+
118+
Run the live preview server (restarts on save):
119+
120+
```bash
121+
poetry run mkdocs serve
122+
```
123+
124+
The site is served at `http://127.0.0.1:8000/` by default.
125+
126+
#### Build for Verification
127+
128+
Before sending a PR, ensure the static build succeeds and address any warnings:
129+
130+
```bash
131+
poetry run mkdocs build
132+
```
133+
134+
Common warnings include missing navigation entries or broken links—update `mkdocs.yml` or the relevant Markdown files to resolve them.
118135

119136
### Writing Docstrings
120137

contributing/RELEASE.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -173,6 +173,21 @@ gh workflow run docs.yml
173173
# Go to Actions tab → Deploy MkDocs to GitHub Pages → Run workflow
174174
```
175175

176+
### Releasing the Documentation Site
177+
178+
Follow these steps when you want to publish documentation updates:
179+
180+
1. Verify the docs build locally:
181+
```bash
182+
poetry run mkdocs build
183+
```
184+
2. Commit any updated Markdown or configuration files and push to the default branch. This triggers the `docs.yml` workflow, which rebuilds and publishes the site to GitHub Pages.
185+
3. (Optional) If you need to deploy immediately without waiting for CI, run:
186+
```bash
187+
poetry run mkdocs gh-deploy
188+
```
189+
This command builds the site and pushes it to the `gh-pages` branch directly.
190+
176191
### Documentation URLs
177192

178193
- **Live Docs**: https://allisonwang-db.github.io/pyspark-data-sources

docs/datasources/arrow.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# ArrowDataSource
2+
3+
> Requires the [`PyArrow`](https://arrow.apache.org/docs/python/) library. You can install it manually: `pip install pyarrow`
4+
> or use `pip install pyspark-data-sources[arrow]`.
5+
6+
::: pyspark_datasources.arrow.ArrowDataSource

docs/datasources/fake.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# FakeDataSource
2+
3+
> Requires the [`Faker`](https://github.com/joke2k/faker) library. You can install it manually: `pip install faker`
4+
> or use `pip install pyspark-data-sources[faker]`.
5+
6+
::: pyspark_datasources.fake.FakeDataSource

docs/datasources/github.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# GithubDataSource
2+
3+
::: pyspark_datasources.github.GithubDataSource

docs/datasources/googlesheets.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# GoogleSheetsDataSource
2+
3+
::: pyspark_datasources.googlesheets.GoogleSheetsDataSource

docs/datasources/huggingface.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# HuggingFaceDatasets
2+
3+
> Requires the [`datasets`](https://huggingface.co/docs/datasets/en/index) library.
4+
5+
::: pyspark_datasources.huggingface.HuggingFaceDatasets
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# JSONPlaceholderDataSource
2+
3+
::: pyspark_datasources.jsonplaceholder.JSONPlaceholderDataSource

docs/datasources/kaggle.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# KaggleDataSource
2+
3+
> Requires the [`kagglehub`](https://github.com/Kaggle/kagglehub) library.
4+
5+
::: pyspark_datasources.kaggle.KaggleDataSource

docs/datasources/lance.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# LanceSink
2+
3+
> Requires the [`Lance`](https://lancedb.github.io/lance/) library. You can install it manually: `pip install lance`
4+
> or use `pip install pyspark-data-sources[lance]`.
5+
6+
::: pyspark_datasources.lance.LanceSink

0 commit comments

Comments
 (0)