about the audioset-laughter annotations

Hi, thanks a lot for the contribution and the repository. 

I have two questions about the audioset annotations (calling that 999-element set the "audioset-laughter" set hereon): 
1. There are some weird annotations like start=end=0 (examples are on lines 7, 29, 80, 88, 95, 102, ... there are more). Is that a special annotation (e.g., does that mean the whole file contains a laugh etc.)? I don't understand what a zero-length laugh segment means

2. does "window_start" correspond to the start time instant in the youtube video for the recorded audio snippet?

3. "audio_length" and "window_length" seem to be equal at all times, I'm guessing that's the length (in seconds) of the recorded audio snippet I described above, is that correct? 

4. I think [this script](https://github.com/jrgillick/laughter-detection/blob/master/scripts/download_audio_set_mp3s.py) downloads mp3 audio files for youtube videos that are specified on a csv. Some csv files can be downloaded using [this script](https://github.com/jrgillick/laughter-detection/blob/master/scripts/download_audioset_metadata.sh), but it seems like none of the csv files there correspond to the clips in the audioset-laughter annotations (950 of the IDs on the "unbalanced_train_segments.csv" and 38 of the IDs on the "eval_segments.csv" match with the audioset-laughter IDs, but this even is not a full list). Is there a csv that can be fed to the download script to download just the audioset-laughter audio files? 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about the audioset-laughter annotations #20

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

about the audioset-laughter annotations #20

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions