To analyse the myogenic transcriptome and identify novel genes involved in muscle development in an in vivo context, we have constructed a muscle specific cDNA library from GFP-expressing myoblasts purified by fluorescent activated cell sorting of transgenic zebrafish embryos. We have generated 153,428 EST sequences from this library that have been clustered into consensi, mapped to the genome assembly Zv6 and analysed for protein homology. Expression analysis of a randomly picked sample of clones using whole mount in situ hybridisation, identified 30 genes that are expressed specifically within the myotome, one third of which represent novel sequences. These genes have been assigned to syn-expression groups. The sequencing of the myoblast enriched cDNA library has significantly increased the number of zebrafish ESTs, facilitating the prediction of new spliced transcripts in the genome assembly and providing a transcriptome of an in vivo myoblast cell.