Investigate Question-Answering models working on tables

## Context
- Traditional transformers-based models for extractive question-answering tasks operate on contexts that are units of texts in natural language, e.g. a sentence or a paragraph.
- However, in many cases the values of parameters of interest for our neuroscientific applications are contained into tables of articles rather than in the text.
- For instance, the Wikipedia article on Michaelis constant ([here](https://en.wikipedia.org/wiki/Michaelis%E2%80%93Menten_kinetics)) contains several values for this parameter of interest for us, but they are all in a table and no value is mentioned in the text. In fact this is not an isolated case: it's really hard to find Michaelis constant values in the text of any scientific article!
  <img width="467" alt="Screen Shot 2022-08-18 at 11 13 23" src="https://user-images.githubusercontent.com/17013890/185358262-410ef444-0d5e-4d7f-8974-53f734de417f.png">
- There seem to be some models for question-answering that can operate on tabular or text/tabular mixed contexts, like [TAPAS](https://huggingface.co/google/tapas-base-finetuned-wtq).


## Actions
- [ ] How should the tables be represented for TAPAS (or another model) to be able to take it in input (html? csv? ...) ?
  Is this format compatible with what we can get out our parsing pipeline for the various formats (arXiv, medRxiv, bioRxiv, PMC, PubMed, ...) when the article contains a table?
- [ ] Can TAPAS take mixed inputs, i.e. contexts containing both text _and_ tables?
- [ ] How does [`TableQuestionAnsweringPipeline`](https://huggingface.co/docs/transformers/main_classes/pipelines#transformers.TableQuestionAnsweringPipeline) differ from [`QuestionAnsweringPipeline`](https://huggingface.co/docs/transformers/main_classes/pipelines#transformers.QuestionAnsweringPipeline) in `🤗 transformers`?
- [ ] Are there any other models a part from TAPAS that support question-answering on tabular contexts?
- [ ] Test TAPAS (or another model) on a sample related to neruoscience to see if it could potentially work on our use case.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Investigate Question-Answering models working on tables #614

Context

Actions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Investigate Question-Answering models working on tables #614

Description

Context

Actions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions