Cryptococcus neoformans Serotype A Strain H99 Sequence

This sequence data combines whole genome plasmid and BAC end sequences generated by three centers. Approximately 80,000 plasmid end reads and sequence walks come from the Duke IGSP Center for Applied Genomics & Technology. Aproximately 5000 H99 BAC end sequences were generated by Jim Kronstad and the Vancouver Genome Sequence Centre (http://www.bcgsc.bc.ca/). An additional 300,000 plasmid and BAC end sequences have been generated as a part of the Whitehead Institute Fungal Genome Initative (http://www-genome.wi.mit.edu/annotation/fungi/fgi/). These sequence reads have been screened and low quality sequences, vector without insert, and assembled. We estimate that this sequence represents over approximately 10X coverage of the genome. The total size of the genome is estimated at 19.5 Mb; we currently have approximately 50 sequence gaps in the genome, along with some same contigs comprised of repetitive sequence. Work is ongoing to resequence the remaining low quality sequence regions and to span the remaining gaps in the genome.

Our group is interested in the genomic structure of Cryptococcus, in the diversity in the population of Cryptococcus, and in comparison of Cryptococcus Serotype A and Serotype D (currently be sequenced at Stanford University by Richard Hyman and Ron Davis, http://www-sequence.stanford.edu and at TIGR by Claire Fraser and Brendan Loftus http://www.tigr.org) and other serotypes. In the course of this work we are generating a significant amount of C. neoformans serotype A sequence and are delighted to make it available to the research community in a searchable format. Please note that this data is preliminary and that there are both sequence errors and assembly errors within the data.

For researchers interested in specific genes, please feel free to free to use this data; we only ask that you cite us in any publication including this data (C. neoformans H99 sequencing project, Duke IGSP Center for Applied Genomics & Technology, http://cgt.duke.edu/, and the Genome Sequence Centre, BC Cancer Research Centre, http://www.bcgsc.bc.ca/). Please consult the "Data Release Policy" at the BC Cancer Agency's website.

If you are interested in using this data for whole genome purposes such as whole genome expression analysis, whole genome comparisons, evolutionary studies, or structure of the Cryptococcus genome then we ask that you please contact us in regards for using this data for these types of projects. We have put a significant amount of effort into generating this data and appreciate your consideration.

( PROCEED )