-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Currently, the scraper removes all of the HTML tags, but some of them are actually useful. The tag is used to represent emphasis by the comedian, and some of the transcripts are formatted such that jokes are contained within
tags and when they are removed the output becomes a difficult to decern mess.
Ideally, we want to selectively remove tags and use them to better classify the jokes.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels