SessionRegFileSamplesCsv

Jump to bottom

Brian Fox edited this page Sep 11, 2022 · 4 revisions

Home | Expression Sessions | Files

Sample annotations [expr.samples.csv or GEO.samples.csv]

This file contains information about each sample. Here are some important facts about this file:

CSV format (quotes only needed if the value has a comma in it)
the first row is a header where the column names are the annotation keys
the first field in each row is the sample id, and so it needs to match the column names of the expression matrix file (same ids, same order, same count)
the sample id should be follow standard R variable naming convention (i.e. must start with a letter and can only be a combination of letters, digits, ., and _)
each row (after the header) has the annotations for each sample, with annotation values matching the keys at of the header row
in the header row (first line), the column name of the first column (sample ids) must be an empty string or _id. In other words, the first word of the first line of the file needs to be "_id" or "" followed by a comma.
column names cannot have a comma in them or a pipe |
cannot repeat any of the column names
every line in the file needs to have the same number of fields
special characters should be avoided (i.e. >, <, =, ...), especially in columns where scores will be computed

Example of a part of a file:

(c) 2015-2025, Needle Genomics LLC