(a) The information is organized into fields, each with an identifier, shown as the first text on each line
(b) In some entries, these identifiers may be abbreviated to two letters, e.g., RF for reference
(c) Some identifiers may have additional subfields
(d) The CDS subfield in the field FEATURES does not offer the amino acid sequence
I got this question in a job interview.
I’d like to ask this question from Sequence Formats & Computer Storage of Sequences in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics
Read more here: Source link