-
Notifications
You must be signed in to change notification settings - Fork 2
Chore/code modernization #21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
61 commits
Select commit
Hold shift + click to select a range
af0e1ec
refactor: switch from Conda to uv for environment management
Essmaw 8d730ef
refactor: use click and pathlib.Path for CLI
Essmaw 81194b0
fix: update LangChain imports to restore compatibility
Essmaw c4b528b
refactor: move prompt to txt file in markdown format for clarity
Essmaw 492c57b
feat: add prompt_path option to CLI to query the chatbot
Essmaw a74c62d
refactor: streamline document processing when creating the database.
Essmaw ec52352
chore: add pre-commit configuration and dev dependency for prek
Essmaw 1b73bf9
chore: remove unnecessary scripts
Essmaw 1e9590b
feat: extend header renumbering to support level 5 headers in Markdow…
Essmaw 0af9c61
doc(parse_clean_markdown): add comments and improve docstrings
Essmaw e1e5a12
feat: enhance `create_database` script to support model and provider …
Essmaw 2628212
feat: add logger module with configurable logging to file and stdout
Essmaw a43861b
feat: improve logging with an example of chunk metadata for each mark…
Essmaw 538a039
refactor: remove useless constants
Essmaw c43ff00
docs: fix logs
Essmaw d575452
chore: update pre-commit
Essmaw bb76583
docs: update log
Essmaw 79ac42a
chore: uodate dependencies
Essmaw 809c54d
docs: update logs
Essmaw f1505c6
docs: improve documentation formatting with ruff
Essmaw 7203587
chore: adding dependency
93b2087
feat: adding pydantic model settings to configure CLI of BioPyAssista…
27702ce
refactor: fix the add of file_name in chunk metadata
0a7fa9b
chore: fix some docstrings and misspelling
eee5c7b
[WIP] - feat: adding the new version of streamlit IU
5e9c73e
feat: adding prompt path to the setting model.
Essmaw b1faabd
feat: simplify footer, adding the llm and prompt settings and improve…
Essmaw 83e5eaa
feat: create YAML file listing all Python chapters and levels
cb0b263
refactor(parse_clean_markdown): load chapters and output paths from Y…
6a04d81
docs: change log level to success for chapter loading message
ff1bdf0
refactor(create_database): update argument structure to use YAML for …
71de1da
feat(create_database): add chapter IDs and separate file_path and fi…
be07be0
chore: remove unuseful files
2f1ca08
chore: remove unused gradio dependency
d25335b
chore(app): remove model config settings (not part of this PR)
dd4f61a
chore(pre-commit): update ruff version and format configuration
abdbb0b
feat(create_database): enable embeddings with all OpenRouter models a…
adc182d
feat(query_chatbot): add level-filtered retrieval, vector DB and LLM…
01db3e7
chore(prompts): rename prompt directory to prompts
508e123
fix(parse_clean_markdown): validate processed_file_path before creat…
87397f9
chore(chapters_and_levels): update chapter IDs to string format
ebc0872
chore(readme): update setup instructions to use uv for environment ma…
1b64888
docs: update scripts to use 'uv' for execution
6c33371
chore: revert Streamlit app to previous version for history
55ba0fb
chore: remove deprecated config_app.toml file
a4c7d37
chore: update logging messages and argument names
1787b52
fixes(query_chatbot): rename ai_api_key to api_key for ChatOpenAI ini…
7200618
refactor(prompts): rename prompt folder and files for consistency
1838eb9
refactor(yml file): add quotes to values, replace source_file_path an…
d8c2ef8
chore(gitignore): ignore raw and processed course data paths
f19eb97
refactor(parse_and_clean_markdown): build chapter source and destinat…
2b33578
refactor(yml): update prompt_file to prompt_path for consistency
b66f18c
feat(query_chatbot): answer to the user question even though it is no…
8c39644
Refactor(create_database): update retrieval to use processed_file_path
f838700
refactor(query_chatbot): log answer sentence by sentence for clarity
3fd7939
chore: added .gitkeep files to ensure data directories are tracked
418cb3e
chore: update readme
340ce00
refactor: add checks to avoid AttributeError/UnboundLocalError
9bd746b
refactor(create_database): remove short-form options (-c, -s...) on CLI
911c274
refactor(prompt): translate the title in french and remove useless `<<<`
5f1fe1c
refactor(create_database): rename arguments for consistency
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,36 @@ | ||
| # Install pre-commit hooks with: | ||
| # prek install | ||
| exclude: "|tmp/*|" | ||
| repos: | ||
| - repo: https://github.com/pre-commit/pre-commit-hooks | ||
| rev: v6.0.0 | ||
| hooks: | ||
| - id: end-of-file-fixer | ||
| - id: mixed-line-ending | ||
| - id: trailing-whitespace | ||
| - id: check-json | ||
| - id: check-yaml | ||
| - id: check-added-large-files | ||
| args: ["--maxkb=5000"] | ||
|
|
||
| - repo: https://github.com/asottile/pyupgrade | ||
| rev: v3.21.2 | ||
| hooks: | ||
| - id: pyupgrade | ||
|
|
||
| - repo: https://github.com/astral-sh/ruff-pre-commit | ||
| # Ruff version. | ||
| rev: v0.14.14 | ||
| hooks: | ||
| # Run the linter. | ||
| - id: ruff-check | ||
| types_or: [python, pyi] | ||
| args: [--fix] | ||
| # Run the formatter. | ||
| - id: ruff-format | ||
| types_or: [python, pyi] | ||
|
|
||
| - repo: https://github.com/PyCQA/bandit | ||
| rev: "1.9.2" | ||
| hooks: | ||
| - id: bandit | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The top-level
exclude: "|tmp/*|"is a regex that matches the empty string (because of leading/trailing|alternations), which effectively excludes all files from running hooks. Use a proper regex that only matches the tmp directory (e.g.(^|/)tmp/).