Skip to content

feat: add comprehensive proptest fuzzing with property tests#8

Merged
Berektassuly merged 1 commit intomainfrom
claude/add-proptest-fuzzing-j5ElW
Dec 27, 2025
Merged

feat: add comprehensive proptest fuzzing with property tests#8
Berektassuly merged 1 commit intomainfrom
claude/add-proptest-fuzzing-j5ElW

Conversation

@Berektassuly
Copy link
Copy Markdown
Owner

Expand property-based testing coverage from 13 to 76 test cases covering:

  • Telegram parsing: extract_telegram_text, parse_unix_timestamp, parse_telegram_message with various JSON structures
  • WhatsApp parsing: detect_whatsapp_format (5 date formats), is_whatsapp_system_message, parse_whatsapp_timestamp
  • Instagram parsing: fix_mojibake_encoding, parse_instagram_message
  • Discord parsing: parse_discord_message with attachments/stickers
  • Filter system: date parsing, combined filters, edge cases
  • Output formats: CSV, JSON, JSONL validity and special char handling
  • Unicode: ZWJ sequences, combining diacritics, Cyrillic, CJK, Arabic, emoji
  • Robustness: no-panic guarantees, idempotency, monotonicity

Add comprehensive strategies for generating test data:

  • arb_telegram_text_value() - JSON with strings, arrays, nested objects
  • arb_whatsapp_line() - lines for each WhatsApp date format
  • arb_discord_raw_message() - full Discord message structures
  • arb_instagram_raw_message() - Instagram with shares
  • arb_date_string() - valid and invalid date formats

Expand property-based testing coverage from 13 to 76 test cases covering:

- Telegram parsing: extract_telegram_text, parse_unix_timestamp,
  parse_telegram_message with various JSON structures
- WhatsApp parsing: detect_whatsapp_format (5 date formats),
  is_whatsapp_system_message, parse_whatsapp_timestamp
- Instagram parsing: fix_mojibake_encoding, parse_instagram_message
- Discord parsing: parse_discord_message with attachments/stickers
- Filter system: date parsing, combined filters, edge cases
- Output formats: CSV, JSON, JSONL validity and special char handling
- Unicode: ZWJ sequences, combining diacritics, Cyrillic, CJK, Arabic, emoji
- Robustness: no-panic guarantees, idempotency, monotonicity

Add comprehensive strategies for generating test data:
- arb_telegram_text_value() - JSON with strings, arrays, nested objects
- arb_whatsapp_line() - lines for each WhatsApp date format
- arb_discord_raw_message() - full Discord message structures
- arb_instagram_raw_message() - Instagram with shares
- arb_date_string() - valid and invalid date formats
@Berektassuly Berektassuly merged commit 97165cf into main Dec 27, 2025
7 of 8 checks passed
@Berektassuly Berektassuly deleted the claude/add-proptest-fuzzing-j5ElW branch December 27, 2025 17:01
@codecov-commenter
Copy link
Copy Markdown

⚠️ Please install the 'codecov app svg image' to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants