finetune preprocessing adding padding to the dataset error

I downloaded the pretrained small model and was trying to fine tune for question/answer using "squad"

Here is where I am running into trouble

`      # add padding so the dataset is a multiple of batch_size
      while n_examples % batch_size != 0:
        writer.write(self._make_tf_example(task_id=len(self._config.task_names))
                     .SerializeToString())`

The above _make_tf_example() call is throwing the following error...

Traceback (most recent call last):
  File "/Users/joelsprunger/Documents/electra/finetune/preprocessing.py", line 111, in serialize_examples
    writer.write(self._make_tf_example(task_id=len(self._config.task_names))
  File "/Users/joelsprunger/Documents/electra/finetune/preprocessing.py", line 141, in _make_tf_example
    value=list(values)))
TypeError: array(0) has type numpy.ndarray, but expected one of: int

When I debug inside this call it looks like the _feature_spec.name == 'squad_eid' is returning array(0) rather than a list of zeros for the following line. 

`values = spec.get_default_values()`

Not sure if this is a bug, or I have done something wrong.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetune preprocessing adding padding to the dataset error #137

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

finetune preprocessing adding padding to the dataset error #137

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions