Issue/gemnai by sathwik13198 · Pull Request #188 · sohzm/cheating-daddy

sathwik13198 · 2025-12-27T13:21:26Z

Pseudo-Live Interview Assistant

A near-real-time interview assistant that listens to spoken interview questions, understands them, and generates concise, accurate answers using Gemini — without relying on gated or unsupported Live APIs.

📌 Overview

This project is designed to assist users during interviews by:

Listening to interview questions spoken aloud (Zoom / Google Meet / Microsoft Teams)
Understanding the question context
Generating correct and concise answers in near real time
Supporting screenshots along with text when required

The system follows a pseudo-live architecture that is production-safe, scalable, and compatible with currently available Gemini models.

❗ Original Issue (Gemini Live API)

The initial implementation attempted to use Gemini Live API (bidiGenerateContent) for streaming audio + reasoning.

Root Causes

1. Unsupported Schema

The Gemini Live API does not support:

Speaker diarization
enableSpeakerDiarization
minSpeakerCount

This caused early JSON validation failures.

2. Live API Access Is Gated

Inspection via GET /v1beta/models revealed:

No models available to the API key supported bidiGenerateContent
No *-live-* models were enabled
Live API access is project-level and allow-listed

➡️ This was an account / project limitation, not a code bug.

💡 Key Insight

Gemini is fundamentally a text-first reasoning model.

It does not require raw audio input to:

Understand interview questions
Generate accurate answers

Audio must be converted to text before reasoning, which is how Copilot-style tools work.

✅ Implemented Solution: Pseudo-Live Architecture

Instead of:

Audio → Gemini Live API

The system uses:

Audio → Speech-to-Text → Text → Gemini

This approach:

Avoids gated Live APIs
Is production-safe
Provides near-real-time performance
Works with currently available Gemini models

🧠 Final Architecture

Interviewer speaks (Zoom / Meet / Teams)
        ↓
System audio capture
        ↓
Voice Activity Detection (VAD)
        ↓
Streaming Speech-to-Text
        ↓
Clean interview question (text)
        ↓
Gemini 2.5 Flash (generateContent)
        ↓
Answer displayed to user

⏱️ Latency Characteristics

Typical timings:

Audio capture + VAD: ~10–50 ms
Streaming STT: ~100–300 ms
Gemini 2.5 Flash response: ~200–500 ms

Total end-to-end latency: ~400–800 ms

⚙️ Important Implementation Details

Partial transcripts are not sent to Gemini
Text is submitted only when:
- Silence > ~600 ms
- Sentence ends (?, .)
- VAD commit event
Screenshots and text can be sent together in a single request

🏁 Final Verdict

This pseudo-live architecture:

Fully resolves the original Gemini API issue
Avoids unsupported and gated APIs
Is stable, scalable, and production-ready
Can later upgrade to the Live API with minimal changes

🙌 Credits & Acknowledgment

This repository is forked from Sushma1969.

All credit for the original idea, base architecture, and implementation goes to Sushma.
This fork only introduces a targeted fix and architectural adjustment required due to Gemini API limitations.

📄 License

This project follows the same license as the original repository.

Summary by CodeRabbit

New Features
- Voice Activity Detection with Automatic and Manual modes, manual mic toggle, and keyboard toggle
- Pseudo-live streaming assistant with real-time STT (multiple providers) and improved model selection/badge
- Ollama optional chat provider and provider test in Settings
- Enhanced code syntax highlighting and LaTeX rendering; copy-response/code shortcut
Documentation
- Added architecture, integration, quick-start, testing, and STT comparison guides
Refactor
- Removed History view; updated defaults (exam profile, compact layout) and window/stealth behavior
Tests
- Expanded test suites for languages, prompts, VAD, timer, token tracking, and STT features

_{✏️ Tip: You can customize this high-level summary in your review settings.}

* Added Highlight.js version 11.9.0 for enhanced code highlighting capabilities. * Introduced a new dark theme CSS for VSCode to improve code readability in dark mode.

* Updated Electron version to ^30.0.5 for improved stability. * Added new dependencies including @modelcontextprotocol/sdk, accepts, ajv, body-parser, and others to enhance functionality. * Updated various existing dependencies to their latest versions for better performance and security. * Removed unnecessary dev flags from some dependencies.

* Updated the response container's CSS for improved code block appearance, including a new dark background and enhanced padding. * Implemented detailed syntax highlighting for various programming languages using Highlight.js, improving readability and user experience. * Added functionality to automatically detect and display the programming language in code blocks. * Introduced a method to enhance code blocks with language-specific classes for better styling consistency.

* Increased default window height to improve user experience. * Adjusted dynamic window height settings for various views to provide better layout consistency. * Added Highlight.js and VSCode dark theme CSS for improved code block styling and syntax highlighting.

…sed-on-codebase Add repo guidelines

…est-cases-in-github-actions Replace Jest with Vitest

* build: improve Windows CI workflow * ci: run tests on ubuntu and windows

- Add global Ctrl+G keyboard shortcut to clear session and automatically restart - Implement handleClearAndRestart method in CheatingDaddyApp for session management - Add "Restart session" button in AppHeader with Ctrl+G styling matching Hide button - Add visual shortcut hints across MainView, AssistantView, and AppHeader - Ensure Ctrl+G works globally across all application views - Automatically start new session after clearing (100ms delay)

- Added @ricky0123/vad-node dependency to original project - Integrated VAD toggle in CustomizeView with persistent settings - Enhanced renderer.js with VAD audio processing for both Linux and Windows - Added VAD IPC handlers in main process (send-vad-audio-segment, update-vad-setting) - VAD processor conditionally initialized based on user settings - Maintains backward compatibility - works with or without VAD enabled - All tests passing (10/10) including new VAD test suite

- Reduced CustomizeView padding from 12px to 8px - Decreased settings container gaps from 12px to 8px - Optimized form section margins and padding - Reduced checkbox group margin-bottom from 10px to 6px - Decreased window height for customize view (720px normal, 620px compact) - Maintained all functionality while making UI more compact - All tests passing (10/10)

- Main window: 650x450 800x450 (wider for better usability) - Compact layout: 500x350 650x350 (proportionally wider) - Customize view max-width: 600px 750px (matches new width) - Final dimensions provide perfect balance: * Height: Optimal for screen real estate (much shorter than original) * Width: Comfortable for content readability and interaction - Maintains responsive design and all functionality - All tests passing (10/10)

- Removed CLEANUP_SUMMARY.md, CONTRIBUTING.md, FEATURES.md - Removed FRESH_FORK_SETUP.md, GITHUB_CONTRIBUTION_GUIDE.md - Removed temporary UI comparison images (New UI.jpg, Old UI.jpg, Width Issue.jpg) - Local repository now matches intended final state - Ready for clean pull request

…into private

…odel to gemini-3-pro

…matical responses

coderabbitai · 2025-12-27T13:21:38Z

📝 Walkthrough

Walkthrough

Adds a production-ready pseudo-live pipeline: VAD-driven audio capture, streaming STT (Gemini/Google), a Pseudo-Live Orchestrator with circuit-breaker and metrics, expanded IPC/preload surface, UI updates for VAD/mic/provider, many new STT/orchestrator modules, large docs, and extensive tests.

Changes

Cohort / File(s)	Summary
Orchestrator & Resilience `src/utils/pseudoLiveOrchestrator.js`	New production-grade PseudoLiveOrchestrator, CircuitBreaker, PerformanceMetrics; VAD/STT coordination, transcript deduplication, Gemini submission, telemetry and lifecycle APIs.
STT Implementations `src/utils/geminiSTT.js`, `src/utils/googleSpeechSTT.js`, `src/utils/speechToText.js`	New streaming STT services: GeminiSTT (streaming, buffering, question detection), GoogleSpeechSTT (Google Cloud streaming), and a SpeechToText scaffold.
VAD & Audio `src/utils/vad.js`, `src/utils/gemini.js` (macOS VAD parts), `src/utils/googleSpeechSTT.js`	New VADProcessor (modes, state machine, buffering), VAD integration in gemini flows, PCM/Float32 helpers and VAD-aware audio routing.
Gemini Integration & IPC `src/utils/gemini.js`, `src/utils/renderer.js`, `src/preload.js`, `src/index.js`	Expanded Gemini routing (live/regular), pseudo-live toggles, many new IPC handlers (initialize/enable/stop/status/metrics/update/toggle), macOS audio hooks, and wider renderer API surface.
UI Components & Views `src/components/app/AppHeader.js`, `src/components/app/CheatingDaddyApp.js`, `src/components/views/AssistantView.js`, `src/components/views/CustomizeView.js`, `src/components/views/MainView.js`, `src/components/views/HelpView.js`	New/updated public properties and handlers for mode/model, mic/VAD, chat provider, restart flow, localized greetings, LaTeX/code rendering, and shortcut wiring.
New/Removed Components `src/components/common/CustomDropdown.js`, `src/components/views/HistoryView.js`, `src/components/index.js`	Added CustomDropdown component; removed HistoryView and its export from barrel file.
Ollama Integration `src/utils/ollama.js`, `INTEGRATION_GUIDE.js`, `verify-ollama-integration.js`, docs (`OLLAMA_INTEGRATION_COMPLETE.md`, `OLLAMA_INTEGRATION_COMPLETE_FINAL.md`)	New Ollama utilities, provider routing in gemini, preload exposure, settings UI wiring, and verification script plus docs.
Prompts & Utilities `src/utils/prompts.js`, `src/utils/pseudoLiveOrchestrator.js`	Prompt rewrites (profiles, coding rules, language gating) and orchestrator-linked prompt handling.
Window, Stealth & Native `src/utils/window.js`, `src/utils/stealthFeatures.js`, `src/utils/windowsNative.js`	Window movement/drift prevention, always-on-top maintenance, new shortcuts (copyCodeBlocks, toggleMicrophone, restart), and Windows native screen-share hiding helpers.
Testing `src/__tests__/*` (many test files)	Extensive new tests: geminiConversation, languages, prompts, timer, tokenTracking, vad, etc.
Docs & Guides `ARCHITECTURE.md`, `PSEUDO_LIVE_README.md`, `PSEUDO_LIVE_INTEGRATION.js`, `IMPLEMENTATION_*`, `PRODUCTION_IMPLEMENTATION.md`, `TESTING_GUIDE.md`, `QUICK_START.md`, `STT_COMPARISON.md`, `DELIVERABLES.md`	Large documentation additions covering architecture, integration, testing, deployment, STT comparisons, and quickstart.
Build & Config `forge.config.js`, `package.json`, `.gitignore`	packager asar -> object unpack rule, afterCopy macOS chmod step, maker config tweaks, dependency additions/upgrades, and new `.gitignore` entry `CLAUDE.md`.
Assets & HTML `src/index.html`, `src/assets/katex-0.16.9.min.css`	KaTeX assets added and integrated into renderer HTML.
New Utilities & Services `src/utils/geminiSTT.js`, `src/utils/googleSpeechSTT.js`, `src/utils/ollama.js`, `src/utils/pseudoLiveOrchestrator.js`, `src/utils/vad.js`, `src/utils/speechToText.js`	Multiple new service modules exported for STT, orchestrator, VAD and Ollama interactions.
Removed Legacy Docs `AGENTS.md`	Deleted (guidelines/planning notes removed).

Sequence Diagram(s)

sequenceDiagram
    participant User as Interviewee
    participant AudioCapture as System Audio Capture
    participant VAD as VAD Processor
    participant STT as STT Service
    participant Orchestrator as Pseudo-Live Orchestrator
    participant Gemini as Gemini API
    participant UI as Renderer/UI

    User ->> AudioCapture: Speak (PCM frames)
    AudioCapture ->> VAD: Send frames
    activate VAD
    VAD ->> VAD: Detect speech / buffer
    alt Speech detected
        VAD ->> Orchestrator: Emit speech segment
    end
    deactivate VAD

    activate Orchestrator
    Orchestrator ->> STT: transcribe(segment)
    deactivate Orchestrator

    activate STT
    STT ->> STT: stream -> accumulate transcript
    STT ->> Orchestrator: onTranscriptComplete(text)
    deactivate STT

    activate Orchestrator
    Orchestrator ->> Orchestrator: circuit-breaker check & dedupe
    Orchestrator ->> Gemini: send transcript + context
    Gemini ->> Orchestrator: return response
    Orchestrator ->> UI: update transcript & response
    deactivate Orchestrator

    UI ->> User: render response

sequenceDiagram
    participant Renderer as Renderer
    participant IPC as IPC Channel
    participant Main as Main Process
    participant Orchestrator as PseudoLiveOrchestrator
    participant VAD as VADProcessor
    participant STT as GeminiSTTService

    Renderer ->> IPC: initialize-pseudo-live(config)
    IPC ->> Main: invoke initialize
    Main ->> Orchestrator: new PseudoLiveOrchestrator(config)
    Orchestrator ->> VAD: init VADProcessor
    Orchestrator ->> STT: init GeminiSTTService
    Main ->> IPC: ready
    IPC ->> Renderer: ready

    Renderer ->> IPC: send-audio-segment(pcm)
    IPC ->> Main: route to orchestrator.processAudioFrame
    Main ->> Orchestrator: processAudioFrame(pcm)
    Orchestrator ->> VAD: process frame
    alt VAD segment -> speech
        VAD ->> Orchestrator: emit segment
        Orchestrator ->> STT: transcribe(segment)
        STT ->> Orchestrator: transcriptComplete
        Orchestrator ->> Main: update UI via IPC
    end

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Poem

🐇 I hopped through code to catch each voice—
VAD ears twitch, STT lends choice.
Circuits watch and metrics beam,
Pseudo-live brings the near-live dream.
Now interviews flow, swift as steam.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 47.83% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Title check	❓ Inconclusive	The PR title 'Issue/gemnai' is vague and does not clearly convey the substantial changes made in this PR, which include pseudo-live architecture, STT integration, VAD improvements, Ollama support, and extensive documentation.	Use a more descriptive title that reflects the main changes, such as 'Implement pseudo-live interview assistant with STT and Ollama integration' or 'Replace Gemini Live API with pseudo-live architecture using STT'.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 15

Note

Due to the large number of review comments, Critical, Major severity comments were prioritized as inline comments.

🟡 Minor comments (14)

src/components/common/CustomDropdown.js-17-20 (1)
17-20: Unusual container width may cause layout overflow.

The .dropdown-container has width: 120%, which will extend beyond its parent container and may cause horizontal scrolling or clipping issues.
🔎 Suggested fix
         .dropdown-container {
             position: relative;
-            width: 120%;
+            width: 100%;
         }
src/components/common/CustomDropdown.js-184-189 (1)
184-189: Click-outside detection may not work correctly with shadow DOM.

this.shadowRoot.contains(e.target) will return false for clicks inside the shadow DOM when the event originates from the document level, because e.target is retargeted to the host element. This could cause the dropdown to close unexpectedly when clicking inside it.
🔎 Suggested fix using composedPath()
     _handleClickOutside(e) {
-        if (!this.shadowRoot.contains(e.target) && this.isOpen) {
+        const path = e.composedPath();
+        if (!path.includes(this) && this.isOpen) {
             this.isOpen = false;
             this.requestUpdate();
         }
     }
src/utils/speechToText.js-264-268 (1)

264-268: Language in callback metadata uses stale config value.

The commitTranscript method passes STT_CONFIG.languageCode which may not reflect the language used during streaming if startStreaming was called with a different languageCode parameter. Use an instance property set during startStreaming.

PSEUDO_LIVE_README.md-39-39 (1)

39-39: Silence threshold documentation doesn't match implementation.

The documentation states "600ms threshold" but src/utils/vad.js defines silenceThreshold: 200. This discrepancy appears in multiple places in this file (lines 39, 81, 397).

Update all references to use the correct 200ms value, or update the code if 600ms is the intended threshold.
ARCHITECTURE.md-141-150 (1)
141-150: Documentation shows incorrect silenceThreshold value.

The documentation states silenceThreshold: 600 but src/utils/vad.js defines it as 200 (line 18). This discrepancy could mislead developers tuning VAD behavior.
🔎 Proposed fix
 **Thresholds:**
 ```javascript
 {
-    silenceThreshold: 600,    // 600ms silence = question done
+    silenceThreshold: 200,    // 200ms silence = question done (OPTIMIZED for faster response)
     minRecordingTime: 200,    // 200ms minimum
     maxRecordingTime: 20000,  // 20s maximum
     preSpeechPadFrames: 2,    // 2 frames before speech
     postSpeechPadFrames: 1,   // 1 frame after speech
 }
ARCHITECTURE.md-408-418 (1)
408-418: Latency percentages sum to 114%, which is mathematically incorrect.

The percentage breakdown for Gemini STT shows STT Buffering at 55% and Gemini Response at 52%, totaling 114% (plus 4% + 3% = 121%). This appears to be a copy-paste error from the previous section.
🔎 Proposed fix

Recalculate the percentages based on actual latency values:
 Component          Target    Actual    % of Total
 ─────────────────────────────────────────────────
-Audio Capture      10-50ms   ~30ms     4%
-VAD Processing     10-50ms   ~20ms     3%
-STT Buffering      300-500ms ~410ms    55%
-Gemini Response    200-500ms ~387ms    52%
+Audio Capture      10-50ms   ~30ms     4%
+VAD Processing     10-50ms   ~20ms     2%
+STT Buffering      300-500ms ~410ms    48%
+Gemini Response    200-500ms ~387ms    46%
 ─────────────────────────────────────────────────
 TOTAL             600-1000ms 847ms     100% ⚠️
src/components/views/MainView.js-310-312 (1)
310-312: Malformed HTML structure and platform-inconsistent shortcut hint.

The HTML indentation is broken - the <p> tag alignment is inconsistent with the surrounding markup.

The shortcut text hardcodes "Ctrl+Alt+R" but Mac users would expect "Cmd+Option+R". Other shortcuts in this file (see getStartButtonText()) already handle platform detection.
🔎 Proposed fix
-                    <p class="shortcut-hint">
-            Press <strong>Ctrl+Alt+R</strong> to clear session and automatically restart
-        </p>
+            <p class="shortcut-hint">
+                Press <strong>${this.isMac() ? 'Cmd+Option+R' : 'Ctrl+Alt+R'}</strong> to clear session and automatically restart
+            </p>
You'll need to extract the platform detection logic or add a helper method:
isMac() {
    return navigator.platform.toUpperCase().indexOf('MAC') >= 0;
}
Committable suggestion skipped: line range outside the PR's diff.
src/utils/windowsNative.js-98-99 (1)
98-99: Unused import: ref-napi is imported but never used.

Line 99 imports ref-napi but it's never referenced in the ffi-napi fallback logic.
🔎 Proposed fix
             const ffi = require('ffi-napi');
-            const ref = require('ref-napi');
src/utils/geminiSTT.js-193-194 (1)
193-194: Misleading log message in constructor.

"Service initialized" is logged in the constructor, but actual initialization (API key validation, model test) happens in the async initialize() method. This can mislead debugging.
🔎 Proposed fix
         console.log('✅ [GEMINI STT] Service initialized');
+        // Note: This is constructor completion. Call initialize() to fully set up the service.
Or change the message:
-        console.log('✅ [GEMINI STT] Service initialized');
+        console.log('🔧 [GEMINI STT] Service instance created (call initialize() to start)');
PSEUDO_LIVE_INTEGRATION.js-327-329 (1)
327-329: beforeunload handler cannot await async functions.

The beforeunload event handler is marked async and awaits shutdownPseudoLiveSystem(), but browsers do not wait for promises in beforeunload handlers. The shutdown may not complete before the page unloads.
🔎 Proposed fix

For Electron apps, consider using window.addEventListener('close') via IPC or the main process will-quit event instead. For synchronous cleanup in the renderer:
-window.addEventListener('beforeunload', async (e) => {
-    await shutdownPseudoLiveSystem();
-});
+window.addEventListener('beforeunload', (e) => {
+    // Trigger async cleanup but don't await
+    // Main process should handle graceful shutdown
+    window.api.stopPseudoLive?.();
+});
Or move cleanup to Electron's main process lifecycle events.
Committable suggestion skipped: line range outside the PR's diff.
PSEUDO_LIVE_INTEGRATION.js-475-491 (1)
475-491: Duplicate orchestrator-error listener may cause issues.

A listener for orchestrator-error is already registered at line 160. Adding another at line 475 means both handlers will fire on each error, potentially triggering double error notifications and conflicting restart logic.
🔎 Proposed fix

Consolidate the error handling into a single listener:
-// Errors
-window.api.on('orchestrator-error', (data) => {
-    const { error, timestamp } = data;
-    console.error('❌ Error:', error);
-    showErrorNotification('Error processing audio. Please try again.');
-});
+// Errors with auto-restart
+window.api.on('orchestrator-error', async (data) => {
+    const { error, timestamp } = data;
+    console.error('❌ Error:', error);
+    showErrorNotification('Error processing audio. Please try again.');
+    
+    // Auto-restart logic
+    if (restartAttempts < MAX_RESTART_ATTEMPTS) {
+        restartAttempts++;
+        console.log(`Attempting auto-restart (${restartAttempts}/${MAX_RESTART_ATTEMPTS})...`);
+        await new Promise(resolve => setTimeout(resolve, 5000));
+        const success = await initializePseudoLiveSystem();
+        if (success) {
+            console.log('✅ Auto-restart successful');
+            restartAttempts = 0;
+        }
+    } else {
+        showErrorNotification('System failed to recover. Please restart the application.');
+    }
+});
Then remove the duplicate listener at lines 475-491.
Committable suggestion skipped: line range outside the PR's diff.
src/components/app/CheatingDaddyApp.js-299-310 (1)

299-310: The default model 'gemini-3-pro-preview' is valid, but the interview mode model is deprecated.

The default model 'gemini-3-pro-preview' is a valid Gemini 3 series identifier currently available via the API. However, the code also hardcodes gemini-2.0-flash-exp for interview mode (line 306), which is a legacy 2.0 variant that is being deprecated with a scheduled shutdown. Consider updating it to a current stable model like gemini-2.5-flash or gemini-3-pro.
src/components/app/CheatingDaddyApp.js-334-356 (1)
334-356: Indentation issue in handleClearAndRestart method definition.

The method definition has extra leading spaces that break indentation consistency with other methods in the class.
🔎 Fix indentation
-        handleClearAndRestart() {
+    handleClearAndRestart() {
         // Clear the current session and responses
         this.responses = [];
src/components/views/CustomizeView.js-920-948 (1)
920-948: update-vad-setting IPC has no handler on the main process

handleVADChange invokes:
await ipcRenderer.invoke('update-vad-setting', this.vadEnabled);
but the main process only defines an update-vad-mode handler in setupGeminiIpcHandlers; there is no ipcMain.handle('update-vad-setting', ...). This will cause ipcRenderer.invoke to reject with “no handler” and log an error every time the checkbox is toggled.

Either add a no‑op/logging handler on the main side or remove the IPC call and rely purely on localStorage until you need main‑process behavior.

🧹 Nitpick comments (36)

src/config.js (1)
38-50: Consider privacy implications of logging the config directory path.

The config directory path at line 38 includes the user's home directory path. While useful for debugging, this could expose user identifiers in logs that may be shared for support purposes.

Consider using a relative indicator or redacting part of the path in production builds.
🔎 Optional: Redact home directory in logs
 function ensureConfigDir() {
     const configDir = getConfigDir();
-    console.log('Config directory path:', configDir);
+    console.log('Config directory path:', configDir.replace(os.homedir(), '~'));
     if (!fs.existsSync(configDir)) {
src/components/common/CustomDropdown.js (1)

217-265: Add keyboard navigation for accessibility.

The dropdown lacks keyboard support (Tab focus, Enter/Space to open, Escape to close, Arrow keys to navigate options). This is an accessibility gap that prevents keyboard-only users from operating the control.

Consider adding:

tabindex="0" and role="listbox" on the container

role="option" on each option

aria-expanded state

@keydown handler for Enter, Escape, and Arrow key navigation
PRODUCTION_IMPLEMENTATION.md (1)
9-29: Minor: Add language identifier to ASCII diagram code block.

Per static analysis, fenced code blocks should have a language specified for better rendering.
🔎 Suggested fix
-```
+```text
 ┌─────────────────────────────────────────────────────────────────────┐
src/utils/window.js (2)
113-148: Consider reducing the polling interval or using event-driven approach.

The maintainTopPosition interval runs every 2 seconds indefinitely. While the cleanup on closed is correct, consider:

Using a longer interval (e.g., 5 seconds) since this is a fallback mechanism

Only activating when the window loses focus, rather than constant polling

The blur handler at lines 139-148 already handles immediate restoration, so the interval is a safety net that could be less frequent.

477-527: Extract inline JavaScript to a separate function for maintainability.

The 50-line inline JavaScript string passed to executeJavaScript is difficult to maintain, test, and debug. Consider extracting this logic to a function exposed via the preload script or IPC handler.
🔎 Suggested approach

In preload.js, expose a dedicated function:
// preload.js
contextBridge.exposeInMainWorld('api', {
    toggleMicrophoneFromShortcut: async () => {
        // Move the logic here, accessible from main process via IPC
    }
});
Then in window.js:
-                    mainWindow.webContents.executeJavaScript(`
-                        (async () => {
-                            // ... 50 lines of code ...
-                        })();
-                    `).then(result => {
+                    mainWindow.webContents.executeJavaScript(
+                        `window.api.toggleMicrophoneFromShortcut()`
+                    ).then(result => {
src/utils/speechToText.js (1)

130-140: Mock implementation should be clearly documented or completed.

The initializeStreamingConnection method is a placeholder. Consider either implementing proper streaming via @google-cloud/speech or throwing a "not implemented" error to fail fast during development.
src/index.html (1)
105-107: Consider using local KaTeX CSS instead of CDN for consistency.

The KaTeX JS is loaded from local assets (assets/katex-0.16.9.min.js), but the CSS is fetched from CDN. Per the AI summary, src/assets/katex-0.16.9.min.css exists locally. Using it would:

Ensure offline functionality

Maintain version consistency between CSS and JS

Avoid external network dependencies
🔎 Proposed fix
         <link rel="stylesheet" href="assets/highlight-vscode-dark.min.css">
-        <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/katex@0.16.9/dist/katex.min.css">
+        <link rel="stylesheet" href="assets/katex-0.16.9.min.css">
         <script src="assets/katex-0.16.9.min.js"></script>
STT_COMPARISON.md (1)
29-48: Add language specifier to code blocks for consistency.

The performance and cost analysis blocks use ASCII diagrams. Adding a language specifier (e.g., text or plaintext) improves markdown linter compliance and ensures consistent rendering. As per static analysis hints.
🔎 Example fix
 ### Performance
-```
+```text
 End-to-End Latency: 400-800ms
 ├─ Audio capture:    10-50ms
 ...
QUICK_START.md (1)
102-109: Add language specifier to expected output code block.

The expected console output block should have a language specifier for consistent formatting. As per static analysis hints.
🔎 Proposed fix
 **Expected console output:**
-```
+```text
 🎤 [ORCHESTRATOR] Received audio segment from VAD
 📝 [ORCHESTRATOR] Complete transcript: "What is machine learning?"
IMPLEMENTATION_SUMMARY.md (1)
261-278: Add language specifiers to fenced code blocks.

Several code blocks are missing language specifiers (e.g., lines 262 and 275). This affects syntax highlighting and markdown linting.
🔎 Proposed fix
 ### Built-in Metrics
-```
+```javascript
 {
     totalRequests: 47,
     averageLatency: 623,  // Within target!
 ### Access Metrics
-```
+```javascript
 const status = await window.electron.ipcRenderer.invoke('get-orchestrator-status');
 console.log('Performance:', status.metrics);
src/__tests__/vad.test.js (2)
46-57: Tests are trivial and don't validate actual behavior.

These tests only verify that string literals exist in a hardcoded array. They don't test any real VAD mode functionality or behavior.
🔎 Suggested improvement

Consider testing actual VAD mode behavior when VADProcessor can be instantiated:
describe('VAD Modes', () => {
    it('should initialize in LISTENING state for automatic mode', async () => {
        // When VAD library is available, test:
        // const processor = new VADProcessor(() => {}, null, 'automatic');
        // await processor.initializeVAD();
        // expect(processor.getState()).toBe(VADState.LISTENING);
    });

    it('should initialize in PAUSED state for manual mode', async () => {
        // const processor = new VADProcessor(() => {}, null, 'manual');
        // await processor.initializeVAD();
        // expect(processor.getState()).toBe(VADState.PAUSED);
    });
});
38-43: Test doesn't verify dual VAD mode support.

The test name says "should support dual VAD modes" but only checks that VADProcessor is defined. It doesn't validate that both modes work correctly.
src/__tests__/geminiConversation.test.js (2)
76-86: Tests are trivial and don't validate actual behavior.

Similar to the VAD mode tests, these only verify that string literals exist in a hardcoded array. They don't test any real session mode functionality.

Consider testing actual mode-specific behavior, such as different prompts or response handling for each mode.

101-113: Auto-reset behavior is documented but not tested.

The comment mentions "Auto-reset logic happens in gemini.js after 20 responses" but the test only verifies that 20 turns can be saved. Consider adding a test that verifies the auto-reset actually occurs.
🔎 Suggested improvement
it('should auto-reset session after 20 responses', () => {
    initializeNewSession();
    
    // Simulate 21 conversation turns
    for (let i = 0; i < 21; i++) {
        saveConversationTurn(`question ${i}`, `answer ${i}`);
    }
    
    const data = getCurrentSessionData();
    // Verify auto-reset occurred
    expect(data.history.length).toBeLessThan(21);
});
src/__tests__/timer.test.js (1)
3-19: Consider extracting and importing the actual function from AppHeader.js.

The test duplicates the getElapsedTime logic instead of importing it from AppHeader.js. This means if the implementation changes, tests may pass while actual behavior differs.

If the function can be extracted to a utility module, import it directly in tests:
// utils/timeFormat.js
export function getElapsedTime(startTime) { ... }

// In tests
import { getElapsedTime } from '../utils/timeFormat';
src/components/views/HelpView.js (1)
357-364: Inconsistent shortcut handling for "Clear session and restart".

The clearSession shortcut is rendered inline using a ternary on isMacOS, but it's not included in the keybinds object (lines 252-268). This means:

It won't be customizable via Settings like other shortcuts

It won't persist if saved to localStorage

If intentional (non-customizable shortcut), this is fine. Otherwise, add it to getDefaultKeybinds() for consistency.
🔎 Proposed fix to add clearSession to keybinds
 getDefaultKeybinds() {
     const isMac = cheddar.isMacOS || navigator.platform.includes('Mac');
     return {
         moveUp: isMac ? 'Alt+Up' : 'Ctrl+Up',
         // ... existing keybinds ...
         copyCodeBlocks: isMac ? 'Cmd+Shift+C' : 'Ctrl+Shift+C',
+        clearSession: isMac ? 'Cmd+Alt+R' : 'Ctrl+Alt+R',
     };
 }
Then update line 363:
-<div class="shortcut-keys">${this.formatKeybind(isMacOS ? 'Cmd+Alt+R' : 'Ctrl+Alt+R')}</div>
+<div class="shortcut-keys">${this.formatKeybind(this.keybinds.clearSession)}</div>
PSEUDO_LIVE_INTEGRATION.js (1)
173-207: Interval timer never cleared; potential resource leak.

setInterval at line 175 runs indefinitely. If this integration code runs in a component that can be unmounted/reloaded, the interval will continue running and accumulate.
🔎 Proposed fix

Store the interval ID and clear it during shutdown:
+let performanceMonitoringInterval = null;
+
 function startPerformanceMonitoring() {
-    setInterval(async () => {
+    performanceMonitoringInterval = setInterval(async () => {
         // ... existing code ...
     }, 10000);
 }

+function stopPerformanceMonitoring() {
+    if (performanceMonitoringInterval) {
+        clearInterval(performanceMonitoringInterval);
+        performanceMonitoringInterval = null;
+    }
+}
Call stopPerformanceMonitoring() in shutdownPseudoLiveSystem().
INTEGRATION_GUIDE.js (1)
77-78: Variable shadowing in example code.

The function parameter is named model and line 77 also uses model for the Gemini model instance. This could cause confusion or bugs when adapting the example.
🔎 Proposed fix
-                    const model = genaiClient.getGenerativeModel({ 
+                    const textModel = genaiClient.getGenerativeModel({ 
                         model: 'gemini-2.5-flash',
                         systemInstruction: { parts: [{ text: systemPrompt }] },
                     });
                     
-                    const result = await model.generateContent(input.text);
+                    const result = await textModel.generateContent(input.text);
src/__tests__/tokenTracking.test.js (2)
3-72: Test duplicates production TokenTracker implementation.

The TokenTracker class is re-implemented in this test file instead of importing from renderer.js. If the production code changes, these tests may pass while the actual implementation is broken (or vice versa).

Consider extracting TokenTracker to a separate module that can be imported by both renderer.js and this test file:
// src/utils/tokenTracker.js
class TokenTracker { /* ... */ }
module.exports = { TokenTracker };

// In renderer.js
const { TokenTracker } = require('./tokenTracker');

// In test
const { TokenTracker } = require('../utils/tokenTracker');
121-129: Timing-dependent test may be flaky.

This test relies on Date.now() differences, which can be affected by system load or CI environment timing variations. The tolerance range (60-68 tokens for ~2 seconds) is reasonable but tight.
🔎 Consider using time mocking

For more reliable tests, use Jest's timer mocks:
it('calculates audio tokens based on elapsed time', () => {
    jest.useFakeTimers();
    const tracker = new TokenTracker();
    
    tracker.audioStartTime = Date.now();
    jest.advanceTimersByTime(2000); // Advance exactly 2 seconds
    tracker.trackAudioTokens();
    
    // 2 seconds × 32 tokens/second = 64 tokens
    const tokens = tracker.getTokensInLastMinute();
    expect(tokens).toBe(64);
    
    jest.useRealTimers();
});
TESTING_GUIDE.md (1)
71-83: Add language specifiers to "Expected Output" code blocks.

Several code blocks showing expected console output lack language specifiers (lines 72, 121, 182, 242, 301, 355). This triggers markdownlint MD040 warnings.
🔎 Proposed fix

Add a language specifier (e.g., text, console, or plaintext) to expected output blocks:
 **Expected Output:**
-```
+```text
 🧪 Testing orchestrator initialization...
 🚀 [ORCHESTRATOR] Initializing pseudo-live pipeline...
Also applies to: 120-130, 182-189, 241-250, 300-309, 354-367
src/utils/windowsNative.js (2)
26-38: Duplicated buffer-to-HWND conversion logic.

The buffer-to-handle conversion appears twice: lines 26-38 for PowerShell and lines 110-117 for ffi-napi. Consider extracting to a helper function.
🔎 Proposed refactor
function bufferToHwnd(nativeHandle) {
    if (Buffer.isBuffer(nativeHandle)) {
        if (nativeHandle.length === 8) {
            return nativeHandle.readBigUInt64LE(0);
        } else if (nativeHandle.length === 4) {
            return nativeHandle.readUInt32LE(0);
        }
        console.warn('Unexpected window handle size:', nativeHandle.length);
        return null;
    }
    return nativeHandle;
}
Then use it in both paths.
Also applies to: 110-117

80-82: Ambiguous timeout handling may mask real failures.

Logging "PowerShell command timed out, but may have succeeded" provides unclear feedback. The window style may or may not be set, making debugging difficult.

Consider verifying success after timeout by checking the window style, or at minimum distinguish this from success in logs:
         } catch (timeoutError) {
-            console.log('PowerShell command timed out, but may have succeeded');
+            console.warn('PowerShell command timed out - style change unverified');
         }
src/__tests__/prompts.test.js (1)
136-144: Minimal language support test provides limited coverage.

This test only verifies the prompt is defined. Consider testing that language instructions are actually embedded or that getSystemPrompt accepts a language parameter.
🔎 Proposed enhancement

If the prompts module supports language-specific content, add more specific tests:
it('has language instruction section', () => {
    const prompt = getSystemPrompt('interview', '', true);
    
    // Verify base prompt structure
    expect(prompt).toBeDefined();
    expect(typeof prompt).toBe('string');
    
    // If language support is dynamic, test the integration point
    // expect(prompt).toContain('LANGUAGE');
    // Or test with a language parameter if supported
});
src/utils/geminiSTT.js (1)
302-320: Queue processing lacks rate limiting for Gemini API.

The processQueue() method processes items as fast as possible without delays between API calls. Under high load (rapid speech), this could hit Gemini's rate limits.
🔎 Proposed fix to add rate limiting
     async processQueue() {
         if (this.isProcessingQueue || this.processingQueue.length === 0) {
             return;
         }
         
         this.isProcessingQueue = true;
         
         while (this.processingQueue.length > 0) {
             const item = this.processingQueue.shift();
             
             try {
                 await this.transcribeAudio(item.audio, item.duration);
+                // Rate limiting: minimum 100ms between API calls
+                await new Promise(resolve => setTimeout(resolve, 100));
             } catch (error) {
                 console.error('❌ [GEMINI STT] Error transcribing audio:', error);
             }
         }
         
         this.isProcessingQueue = false;
     }
src/index.js (1)
128-145: Potential infinite loop if flushStorageData throws repeatedly.

Using event.preventDefault() followed by app.exit() is correct, but if flushStorageData() fails, the error is caught and logged, then app.exit() is called. This is fine, but consider that before-quit may be called multiple times if the first quit attempt is prevented. A guard flag would prevent redundant flush attempts.
🔎 Suggested improvement with guard flag
+let isQuitting = false;
+
 app.on('before-quit', async (event) => {
     stopMacOSAudioCapture();
 
+    if (isQuitting) return;
+    isQuitting = true;
+
     // Flush localStorage and other storage to disk before quitting
     // This is CRITICAL for macOS to persist localStorage between restarts
     event.preventDefault();
src/utils/googleSpeechSTT.js (3)
122-143: Duplicate interimResults configuration.

interimResults: true is set both in STREAMING_CONFIG (line 39) and again when calling streamingRecognize (line 132). The spread operator already includes it from STREAMING_CONFIG.
🔎 Remove redundant parameter
             // Create recognize stream
             this.recognizeStream = this.client
                 .streamingRecognize({
                     config: streamConfig,
-                    interimResults: true,
                 })
                 .on('error', this.handleStreamError.bind(this))
                 .on('data', this.handleStreamData.bind(this));
194-201: Stream error handling stops streaming but doesn't notify caller.

When a stream error occurs, handleStreamError stops the stream and logs the error, but there's no callback to notify the caller about the failure. Consider adding an onError callback similar to onTranscriptComplete.
🔎 Add error callback
 class GoogleSpeechSTT {
-    constructor(onTranscriptComplete, onTranscriptPartial = null) {
+    constructor(onTranscriptComplete, onTranscriptPartial = null, onError = null) {
         this.onTranscriptComplete = onTranscriptComplete;
         this.onTranscriptPartial = onTranscriptPartial;
+        this.onError = onError;
         // ...
     }

     handleStreamError(error) {
         console.error('❌ [GOOGLE STT] Stream error:', error);
         
         // Stop streaming and notify
         this.stopStreaming();
         
-        // TODO: Implement retry logic if needed
+        if (this.onError) {
+            this.onError(error);
+        }
     }
347-357: PCM conversion logic duplicated with pseudoLiveOrchestrator.js.

This float32ToPCM16 function is nearly identical to the conversion in pseudoLiveOrchestrator.js (lines 1251-1257). Consider extracting to a shared utility module to avoid duplication.
README.md (1)
85-97: Consider using proper headings for VAD mode sections.

The static analysis correctly identifies that bold text is being used instead of proper Markdown headings. Using ### or #### headings would improve document structure and accessibility.
🔎 Use proper headings
 ### Voice Activity Detection (VAD) Modes
 
-**Automatic Smart Detection Mode**
+#### Automatic Smart Detection Mode
 - Continuously detects and processes speech in real-time
 - Best for natural conversations where the interviewer speaks continuously
 - Mic is always ON and listening
 
-**Manual Push-to-Talk Mode**
+#### Manual Push-to-Talk Mode
 - User-controlled mic toggle (Ctrl/Cmd + Shift + M)
src/components/app/CheatingDaddyApp.js (1)
358-370: Platform detection uses navigator.platform which is deprecated.

navigator.platform is deprecated and may be removed in future browser versions. Consider using navigator.userAgentData.platform with a fallback, or the existing cheddar.isMacOS if available in this context.
🔎 Use more reliable platform detection
     handleGlobalKeydown(e) {
         // Handle Ctrl+Alt+R (or Cmd+Option+R on Mac) for clearing and restarting session
-        const isMac = navigator.platform.toUpperCase().indexOf('MAC') >= 0;
+        const isMac = navigator.userAgentData?.platform === 'macOS' || 
+                      navigator.platform?.toUpperCase().indexOf('MAC') >= 0;
         const isClearShortcut = isMac
src/preload.js (1)
189-191: The send method lacks channel validation.

Unlike the on method which validates channels, the send method allows sending to any channel. Consider adding a whitelist for outbound channels as well.
🔎 Add channel validation for send
     send: (channel, data) => {
+        const validSendChannels = [
+            'update-keybinds',
+            'view-changed',
+            // Add other valid outbound channels
+        ];
+        
+        if (!validSendChannels.includes(channel)) {
+            console.warn(`[preload] Invalid send channel: ${channel}`);
+            return;
+        }
         ipcRenderer.send(channel, data);
     },
src/__tests__/languages.test.js (1)
503-782: Test assertions are repetitive and don't verify actual question processing.

Many tests iterate through languages and questions but only assert that the language mapping exists, which is already tested earlier. The inner loop over questions doesn't add value since the assertion is the same regardless of the question content.

Consider consolidating these tests or making them verify actual behavior:
-        it('should be ready to handle behavioral questions in all languages', () => {
-            for (const lang of SUPPORTED_LANGUAGES) {
-                const mappedLanguage = GEMINI_LANGUAGE_MAP[lang.code];
-                expect(mappedLanguage).toBeDefined();
-                
-                for (const question of REAL_INTERVIEW_QUESTIONS.interview.behavioral) {
-                    expect(mappedLanguage).toBe(lang.expectedLanguage);
-                }
-            }
-        });
+        it('should have language mappings for all supported languages', () => {
+            for (const lang of SUPPORTED_LANGUAGES) {
+                expect(GEMINI_LANGUAGE_MAP[lang.code]).toBe(lang.expectedLanguage);
+            }
+        });
The question data is valuable for documentation purposes but doesn't need to be iterated in tests unless you're verifying actual question processing behavior.
src/utils/renderer.js (1)
169-179: initializeGemini defaults prevent localStorage mode/model from ever being used

Because the function has default parameters mode = 'interview' and model = 'gemini-2.5-flash', the expressions:
const selectedMode = mode || localStorage.getItem('selectedMode') || 'interview';
const selectedModel = model || localStorage.getItem('selectedModel') || 'gemini-2.5-flash';
will always prefer the hard‑coded defaults when callers omit those arguments, ignoring selectedMode / selectedModel in localStorage.

If you intend to fall back to stored settings when the caller doesn’t specify mode/model, make the parameters nullable and only override when explicitly passed:
-async function initializeGemini(profile = 'interview', language = 'en-US', mode = 'interview', model = 'gemini-2.5-flash') {
+async function initializeGemini(profile = 'interview', language = 'en-US', mode = null, model = null) {
   const apiKey = localStorage.getItem('apiKey')?.trim();
   if (apiKey) {
-    const selectedMode = mode || localStorage.getItem('selectedMode') || 'interview';
-    const selectedModel = model || localStorage.getItem('selectedModel') || 'gemini-2.5-flash';
+    const selectedMode = mode || localStorage.getItem('selectedMode') || 'interview';
+    const selectedModel = model || localStorage.getItem('selectedModel') || 'gemini-2.5-flash';
src/components/views/CustomizeView.js (1)

1129-1158: Model description text doesn’t match actual options

The Exam Assistant description says:

“uses Gemini API 2.5 Flash or 2.5 Pro …”

but the actual options are gemini-2.5-flash and gemini-3-pro-preview. Consider updating the text to match the concrete models (2.5 Flash / 3 Pro Preview) to avoid confusion.

src/utils/gemini.js (1)

1330-1393: Image/screenshot routing logic is nuanced; consider documenting mode-dependent behavior

The new send-image-content and send-screenshot-with-text handlers distinguish:

Interview mode + manual screenshots: wait for Live session readiness and send screenshot plus a rich text prompt.

Interview mode + non-manual screenshots: send raw screenshots only.

Coding/Exam mode: use the combined send-screenshot-with-text path that sends both media and text in one call to the coding-mode “session”.

The branching itself is sound, but the behavior is fairly subtle (e.g., why manual screenshots in interview mode add text while automated ones don’t, and why Live sends two separate messages vs a single combined one in coding mode). A short comment block or docstring outlining these invariants would make future changes safer.

Also applies to: 1413-1459

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ae5ff68 and 5b324a5.

⛔ Files ignored due to path filters (3)

package-lock.json is excluded by !**/package-lock.json
src/assets/katex-0.16.9.min.js is excluded by !**/*.min.js
src/assets/models/500px-Google_Gemini_icon_2025.svg.png is excluded by !**/*.png

📒 Files selected for processing (48)

.gitignore
AGENTS.md
ARCHITECTURE.md
IMPLEMENTATION_COMPLETE.md
IMPLEMENTATION_SUMMARY.md
INTEGRATION_GUIDE.js
PRODUCTION_IMPLEMENTATION.md
PSEUDO_LIVE_INTEGRATION.js
PSEUDO_LIVE_README.md
QUICK_START.md
README.md
STT_COMPARISON.md
TESTING_GUIDE.md
forge.config.js
package.json
src/__tests__/geminiConversation.test.js
src/__tests__/languages.test.js
src/__tests__/prompts.test.js
src/__tests__/timer.test.js
src/__tests__/tokenTracking.test.js
src/__tests__/vad.test.js
src/assets/katex-0.16.9.min.css
src/components/app/AppHeader.js
src/components/app/CheatingDaddyApp.js
src/components/common/CustomDropdown.js
src/components/index.js
src/components/views/AdvancedView.js
src/components/views/AssistantView.js
src/components/views/CustomizeView.js
src/components/views/HelpView.js
src/components/views/HistoryView.js
src/components/views/MainView.js
src/config.js
src/index.html
src/index.js
src/preload.js
src/script.js
src/utils/gemini.js
src/utils/geminiSTT.js
src/utils/googleSpeechSTT.js
src/utils/prompts.js
src/utils/pseudoLiveOrchestrator.js
src/utils/renderer.js
src/utils/speechToText.js
src/utils/stealthFeatures.js
src/utils/vad.js
src/utils/window.js
src/utils/windowsNative.js

💤 Files with no reviewable changes (3)

AGENTS.md
src/components/index.js
src/components/views/HistoryView.js

🧰 Additional context used

🧬 Code graph analysis (21)

src/utils/windowsNative.js (2)

src/utils/window.js (6)

require (1-1)

require (5-5)

require (72-72)

os (4-4)

path (2-2)

fs (3-3)

src/utils/stealthFeatures.js (3)

require (3-3)

require (22-22)

require (47-47)

src/utils/vad.js (3)

src/utils/gemini.js (12)

require (1-1)

require (2-2)

require (3-3)

require (4-4)

require (5-5)

require (6-6)

require (7-7)

require (8-8)

require (1053-1053)

i (1228-1228)

i (1241-1241)

i (1254-1254)

src/__tests__/vad.test.js (1)

require (1-1)

src/utils/renderer.js (4)

require (2-2)

VADProcessor (6-6)

i (151-151)

i (163-163)

src/utils/googleSpeechSTT.js (1)

src/utils/pseudoLiveOrchestrator.js (1)

QUESTION_DETECTION (65-70)

src/components/views/HelpView.js (2)

src/utils/window.js (2)

isMac (215-215)

isMac (546-546)

src/utils/renderer.js (1)

isMacOS (52-52)

src/__tests__/prompts.test.js (1)

src/utils/prompts.js (1)

profilePrompts (1-507)

src/preload.js (1)

src/__mocks__/electron.js (1)

ipcRenderer (5-5)

src/__tests__/vad.test.js (1)

src/utils/vad.js (2)

VAD_CONFIG (15-25)

VADState (7-13)

PSEUDO_LIVE_INTEGRATION.js (2)

src/utils/windowResize.js (1)

window (4-4)

src/utils/gemini.js (3)

response (258-258)

result (257-257)

result (698-698)

src/utils/geminiSTT.js (5)

src/preload.js (1)

require (4-4)

src/utils/gemini.js (15)

require (1-1)

require (2-2)

require (3-3)

require (4-4)

require (5-5)

require (6-6)

require (7-7)

require (8-8)

require (1053-1053)

result (257-257)

result (698-698)

i (1228-1228)

i (1241-1241)

i (1254-1254)

sample (1256-1256)

src/utils/pseudoLiveOrchestrator.js (2)

require (45-45)

require (46-46)

src/utils/speechToText.js (2)

require (14-14)

STT_CONFIG (17-30)

PSEUDO_LIVE_INTEGRATION.js (6)

apiKey (24-24)

result (216-216)

result (235-235)

result (257-257)

result (279-279)

result (308-308)

src/__tests__/languages.test.js (1)

src/utils/gemini.js (1)

selectedLanguageName (399-399)

src/utils/speechToText.js (4)

INTEGRATION_GUIDE.js (1)

require (21-21)

src/utils/geminiSTT.js (2)

require (34-34)

STT_CONFIG (37-59)

src/utils/pseudoLiveOrchestrator.js (2)

require (45-45)

require (46-46)

PSEUDO_LIVE_INTEGRATION.js (1)

apiKey (24-24)

src/utils/window.js (1)

src/utils/windowResize.js (1)

result (5-5)

src/script.js (1)

src/utils/windowResize.js (1)

window (4-4)

src/components/views/CustomizeView.js (5)

src/utils/gemini.js (1)

value (151-166)

src/utils/window.js (2)

isMac (215-215)

isMac (546-546)

src/utils/renderer.js (8)

vadEnabled (207-207)

vadEnabled (223-223)

vadEnabled (337-337)

vadEnabled (435-435)

vadMode (208-208)

vadMode (224-224)

vadMode (340-340)

vadMode (438-438)

src/utils/windowResize.js (1)

window (4-4)

src/__mocks__/electron.js (1)

ipcRenderer (5-5)

src/config.js (1)

forge.config.js (1)

fs (16-16)

src/__tests__/tokenTracking.test.js (1)

src/utils/renderer.js (10)

tilesX (80-80)

tilesY (81-81)

totalTiles (82-82)

elapsedSeconds (95-95)

audioTokens (98-98)

oneMinuteAgo (108-108)

maxTokensPerMin (126-126)

throttleAtPercent (127-127)

currentTokens (129-129)

throttleThreshold (130-130)

src/utils/renderer.js (2)

src/utils/gemini.js (9)

require (1-1)

require (2-2)

require (3-3)

require (4-4)

require (5-5)

require (6-6)

require (7-7)

require (8-8)

require (1053-1053)

src/__mocks__/electron.js (1)

ipcRenderer (5-5)

src/components/app/CheatingDaddyApp.js (2)

src/utils/renderer.js (2)

selectedMode (173-173)

selectedMode (633-633)

src/utils/window.js (2)

isMac (215-215)

isMac (546-546)

src/components/views/AssistantView.js (2)

PSEUDO_LIVE_INTEGRATION.js (1)

language (31-31)

src/__mocks__/electron.js (1)

ipcRenderer (5-5)

src/utils/gemini.js (2)

PSEUDO_LIVE_INTEGRATION.js (6)

result (216-216)

result (235-235)

result (257-257)

result (279-279)

result (308-308)

language (31-31)

INTEGRATION_GUIDE.js (1)

pseudoLiveOrchestrator (28-28)

src/components/app/AppHeader.js (1)

src/utils/renderer.js (1)

cheddar (863-896)

🪛 Biome (2.1.2)

src/assets/katex-0.16.9.min.css

[error] 1-1: Duplicate properties can lead to unexpected behavior and may override previous declarations unintentionally.

width is already defined here.

Remove or rename the duplicate property to ensure consistent styling.

(lint/suspicious/noDuplicateProperties)

src/index.js

[error] 5-5: Illegal return statement outside of a function

(parse)

🪛 LanguageTool

IMPLEMENTATION_SUMMARY.md

[grammar] ~5-~5: Ensure spelling is correct
Context: ...terview Assistant** that achieves 400-800ms end-to-end latency and works with **A...