-
Notifications
You must be signed in to change notification settings - Fork 0
Restrict DOI ingestion to specific file formats and structures #15
Copy link
Copy link
Open
Labels
bugSomething isn't workingSomething isn't working
Description
The current implementation of the DOI ingestion function processes the input file line by line, regardless of file format or structure. This approach allows any file type to be processed. Current code:
with open(args.list_of_dois, "r") as csv_file:
for line in csv_file:
list_of_dois.append(line.strip())
Problem: This code does not validate the file type, so it will try to process any input file (e.g., .txt, .csv, .json, yaml). While it works for line-based formats, this lack of restriction could lead to issues if the input is a file with a different format or structure.
Also, if one passes the invalid .csv file the pipeline does not have a failure feedback mechanism as it gives a Success message.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working