Elucidation of missing exons and transcriptional start site for zebrafish col2a1b: A. Schematic of comparisons of the Ensembl (top) and NCBI (bottom) genomic sequence of the col2a1b gene compared to our updated sequence annotation (middle). The green, blue, and pink squares represent the exons for Ensembl's annotation, current annotation, and NCBI's annotation respectively. Diagram shows that for both the Ensembl and the NCBI annotations, they are missing exons 1, 2, 3, 4, and 8 in both. Ensembl is also missing exons 5, 6, and 9. NCBI is missing the last exon, updated exon 54, that Ensembl does have labeled as the 46th exon. Ensembl has 46 exons for col2a1b in total spanning from 55,224 to 32,007 on the reverse strand of chromosome 11 of the current reference genome: GRCz11 (GCA_000002035.4). NCBI has a total of 50 exons for their annotation spanning from 58,763 to 32,007 on the reverse strand of chromosome 11 of the current reference genome: GRCz11 (GCA_000002035.4) Current annotation of col2a1b has 54 exons and spans from 95,275 to 60,649 on Danio rerio strain Tubingen chromosome 11 genome assembly from NHGRI (GenBank: CP137024.3). Orange and yellow square and grey and pink square represent identified regulatory elements and transcriptional promoter elements of col2a1b. Red lines in schematic for NCBI and Ensembl represent the start of poor sequencing quality upstream of the identified first exon. B. Schematic of the basal promoter elements for col2a1b. BRE (TFIIB Recognition Element), INR (Initiator Element), and DPE (Downstream Promoter Element) were identified.

Acknowledgments
This image is the copyrighted work of the attributed author or publisher, and ZFIN has permission only to display this image to its users. Additional permissions should be obtained from the applicable author or publisher of the image. Full text @ MicroPubl Biol