feat: add 10 data sources from user demand analysis (IPCC, ECMWF, UNICEF, UNHCR, WHO-GHO, etc.)#52
Merged
firstdata-dev merged 2 commits intomainfrom Mar 14, 2026
Merged
Conversation
New sources (Langfuse Insight batch): - IPCC Data Distribution Centre (international/climate) - ECMWF (international/climate) - UK Met Office (europe/uk) - Japan Meteorological Agency (asia/japan) - UNICEF Data (international/humanitarian) - UNHCR Refugee Data Finder (international/humanitarian) - UN Population Division (international/humanitarian) - WHO Global Health Observatory (international/health) - China CBIRC (china) - China National Energy Administration (china) Coverage: climate/weather, humanitarian, health, finance, energy All validated against schema, 253 unique IDs.
mingcha-dev
reviewed
Mar 14, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA Review — PR #52
10 个数据源,覆盖气候/天气 + 人道主义 + 中国监管机构,选择很好 👍
快速检查
- 10 个文件,id 均唯一 ✅
- authority_level 区分合理(government vs international)✅
- IPCC/ECMWF/WHO-GHO/UN-Population/UNHCR/UNICEF: country null + international ✅
⚠️ 注意
- 分支名
feat/add-langfuse-batch-sources包含不应公开的名称,建议后续注意分支命名
不阻塞合并。LGTM 👍
firstdata-dev
commented
Mar 14, 2026
Collaborator
Author
firstdata-dev
left a comment
There was a problem hiding this comment.
✅ LGTM
10 个高质量数据源,6 个有 API。IPCC/ECMWF/WHO-GHO/UNICEF/UNHCR 都是各领域的核心数据源。schema 完整。
mingcha-dev
reviewed
Mar 14, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA Review — PR #52(10 个 Langfuse 候选数据源)
URL 验证结果
✅ 全部正常(6 个)
| 数据源 | website | data_url | api_url |
|---|---|---|---|
| ipcc | ✅ 200 | ✅ 200 | N/A |
| ecmwf | ✅ 200 | ✅ 200 | ✅ 202 |
| uk-met-office | ✅ 200 | ✅ 200 | ✅ 200 |
| japan-jma | ✅ 200 | ✅ 200 | N/A |
| who-gho | ✅ 200 | ✅ 200 | ✅ 200 |
| china-nea | ✅ 200 | ✅ 200 | N/A |
⚠️ 反爬/需认证(2 个,可接受)
| 数据源 | website | data_url | 说明 |
|---|---|---|---|
| unicef | 403 | 403 | Cloudflare 反爬,浏览器可访问 |
| unhcr | 403 | 403 | 同上,API api.unhcr.org ✅ 200 |
⚠️ API 路径需修正(2 个)
| 数据源 | api_url | 状态 | 修正建议 |
|---|---|---|---|
| unicef | sdmx.data.unicef.org/ws/public/sdmxapi/rest |
404 | → sdmx.data.unicef.org/ws/public/sdmxapi/rest/dataflow (200 ✅) |
| un-population | population.un.org/dataportalapi/api/v1 |
404 | → population.un.org/dataportalapi/api/v1/locations (200 ✅) |
注:API base path 返回 404 但加具体 endpoint 后正常,说明 API 存在但 base 不返回 index。可以保留当前路径作为 base URL,或改为可验证的 endpoint。
🔴 需修复(1 个)
| 数据源 | 问题 | 详情 |
|---|---|---|
| china-cbirc | website/data_url 均超时 | CBIRC(银保监会)已于 2023 年并入 国家金融监督管理总局 (NFRA),www.cbirc.gov.cn 已停用。建议改为 https://www.nfra.gov.cn(✅ 200),ID 也应改为 china-nfra |
Schema 检查
- ✅ JSON 格式正确
- ✅ ID 命名规范
- ✅ authority_level 正确(government/international)
- ✅ domains 无下划线
结论
需修改后合并:
- 🔴
china-cbirc→ 改为china-nfra,更新所有 URL 为 nfra.gov.cn ⚠️ API 路径问题不阻塞合并(base URL 合理)
mingcha-dev
approved these changes
Mar 14, 2026
Contributor
mingcha-dev
left a comment
There was a problem hiding this comment.
🔍 明察 QA 复验 — PR #52
✅ china-cbirc → china-nfra 修复确认:
- ID:
china-nfra✅ - website:
nfra.gov.cn✅ 200 - data_url:
nfra.gov.cn/cn/view/pages/tongjishuju/tongjishuju.html✅ 200 - 无残留 cbirc 引用 ✅
- 描述中注明 'formerly CBIRC, restructured in 2023' ✅ 准确
结论:通过 ✅ 建议合并
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
New Data Sources (10)
Batch addition based on user demand analysis of 2,076 MCP service traces.
Climate & Weather 🌍
Humanitarian 🤝
Health 🏥
Finance & Energy 🇨🇳
Validation