questions: get plain text from common crawl

Dear Mr. Sebastian Nagel @sebastian-nagel,
I am the team member of Fordham University S & T team. Would you help me to get plain text content from common crawl.  I have collected some useful URLs by using common crawl index API, it that possible to use these URLs together with WET file to crawl web text content? Is that necessary to use WARC file at the same time?  Thank you so much.

Regards,
Liyi Li.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions: get plain text from common crawl #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

questions: get plain text from common crawl #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions