Dear Mr. Sebastian Nagel @sebastian-nagel,
I am the team member of Fordham University S & T team. Would you help me to get plain text content from common crawl. I have collected some useful URLs by using common crawl index API, it that possible to use these URLs together with WET file to crawl web text content? Is that necessary to use WARC file at the same time? Thank you so much.
Regards,
Liyi Li.
Dear Mr. Sebastian Nagel @sebastian-nagel,
I am the team member of Fordham University S & T team. Would you help me to get plain text content from common crawl. I have collected some useful URLs by using common crawl index API, it that possible to use these URLs together with WET file to crawl web text content? Is that necessary to use WARC file at the same time? Thank you so much.
Regards,
Liyi Li.