AOAC SPADA VNGS - Final

189

The domain for VNGS shall be defined and include the term Biothreat Agent Next Generation

190

Sequences.

191

( m ) Stable URIs and versioning

192

193

Stable URIs for the terms, concepts, and versioning of VNGS shall be maintained by the sequence

194

provider.

195

( n ) Raw Sequence Data

196

197

All raw sequence data shall be available with each VNGS. The possible sequence formats are FASTQ

(26, 27, 28), FAST5 (29), and pod5 (30). In the case of FAST5, these files may be converted to FASTQ If a 198

199

human reader is required.

200

( o ) Aligned Sequence Data

201

202

Aligned sequences shall be included as BAM (Binary Alignment/MAP) formatted files (31, 32).

203

( p ) Annotation Formats

204

205

Annotation formats shall include Browser Extensible Data (BED) Format (33), Wiggle Track Format

(WIG) (34), General Feature Format (GFF3) (35), Variant Call format (VCF) (36), Gene Transfer Format 206

(GTF) (37), Genome Variation Format (GVF) (38) and/or Synthetic Biology Open Language (SBOL) (39). 207

208

( q ) Sequence Instrument Quality Metrics

209

210

(1) Base quality score .—Statistical algorithms used for base calling shall be known, verified and

converted to a Q score (26, 27). Average base quality score Q>20. Single base quality score for the 211

212

targeted region Q>30.

AOAC Draft Standard – Version 09282022; Public Comment Revisions

9

Made with FlippingBook Digital Proposal Maker