Skip to content

SessionRegFileSamplesCsv

Brian Fox edited this page Sep 11, 2022 · 4 revisions

Home | Expression Sessions | Files

Sample annotations [expr.samples.csv or GEO.samples.csv]

This file contains information about each sample. Here are some important facts about this file:

  • CSV format (quotes only needed if the value has a comma in it)
  • the first row is a header where the column names are the annotation keys
  • the first field in each row is the sample id, and so it needs to match the column names of the expression matrix file (same ids, same order, same count)
  • the sample id should be follow standard R variable naming convention (i.e. must start with a letter and can only be a combination of letters, digits, ., and _)
  • each row (after the header) has the annotations for each sample, with annotation values matching the keys at of the header row
  • in the header row (first line), the column name of the first column (sample ids) must be an empty string or _id. In other words, the first word of the first line of the file needs to be "_id" or "" followed by a comma.
  • column names cannot have a comma in them or a pipe |
  • cannot repeat any of the column names
  • every line in the file needs to have the same number of fields
  • special characters should be avoided (i.e. >, <, =, ...), especially in columns where scores will be computed

Example of a part of a file:

Clone this wiki locally