-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
I'm seeing Whisper do goofy, incorrect stuff sometimes.
One is the insertion of earlier phrases later on in transcripts, sprinkled through the transcript.
Another is repeating a phrase dozens of times:
[00:04:05.000 --> 00:04:08.000] >> A question whether that is necessary for an objection
[00:04:08.000 --> 00:04:11.000] to the question.
[00:04:11.000 --> 00:04:14.000] >> I think that's a good question. I think that's a good
[00:04:14.000 --> 00:04:17.000] question. I think that's a good question.
[00:04:17.000 --> 00:04:20.000] >> I think that's a good question. I think that's a good
[00:04:20.000 --> 00:04:23.000] question. I think that's a good question. I think that's a
[00:04:23.000 --> 00:04:26.000] good question. I think that's a good question. I think that's
[00:04:26.000 --> 00:04:29.000] a good question. I think that's a good question. I think that's
[00:04:29.000 --> 00:04:32.000] a good question.
[00:04:32.000 --> 00:04:35.000] >> I think that's a good question. I think that's a good
[00:04:35.000 --> 00:04:38.000] question. I think that's a good question.
Identify a lightweight way to identify these artifacts in transcripts and flag them for review or re-processing.
Metadata
Metadata
Assignees
Labels
No labels