Skip to content
Research FLEXGene Informatics DNA Sequencing Publications Links Contact Us
Yeast FLEXGene

Team Leader - Gerald Marsischky

Introduction

For the past several decades, the yeast Saccharomyces cerevisiae has been the archetype genetic system for eukaryotic model organisms. The ease and power with which its genes can be manipulated have lead to many insights and fundamental discoveries that have impacted all of biology. Given its preeminence as a genetic model system, the relative availability of molecular biology tools for S. cerevisiae is surprisingly less well developed. Nevertheless, it is uniquely positioned for the development and utilization of genome-scale tools: its genome has been fully sequenced and is well annotated, it has a limited repertoire of genes, there are excellent tools for introducing and regulating the expression of cloned genes, and the organism is easy to grow and process in high-throughput experiments.

In order to create a genome-scale toolset for S. cerevisiae, the Yeast Group at HIP together with our collaborators (Richard Kolodner’s group at the Ludwig Institute for Cancer Research at the University of California, San Diego, and Andrew Simpson’s group in the Laboratory of Cancer Genetics at the Ludwig Institute for Cancer Research in Sao Paolo, Brazil) are creating the Yeast FLEXGene Collection, a comprehensive, sequence-verified collection of S. cerevisiae genes in a recombinational cloning format. Once the Collection has been created, it will be made available to the public and private research communities at the lowest possible cost.

At the time of the inception of the Yeast FLEXGene project, the Gateway cloning system from Life Technologies (now owned by Invitrogen) was the best-established recombinational cloning system, and as the only practical recombinational cloning system, was the obvious choice for the project. The strength of the recombinational cloning approach is that a single validated gene collection can be used to create many different gene expression libraries for any number of experiments. The Yeast FLEXGene Collection thus has the potential to become an invaluable tool for basic research. It should also be noted that the yeast FLEXGene project also serves as a prototype for the ongoing human FLEXGene project.

Progress Report

Cloning Effort

Gene specific PCR primers for the 6277 S. cerevisiae ORFs (Research Genetics) were used for high fidelity PCR to amplify all of the ORFs from genomic DNA utilizing a two-step PCR process resulting in the addition of Gateway recombinational cloning sequences to each PCR product.

Figure 1 Figure1. Two-Stage PCR for Gateway Cloning
 Enlarge

These PCR products were then gel purified, recombined into the Gateway donor vector, pDONR201, and transformed into bacteria in order to create Gateway Entry plasmids with an apparent overall cloning success rate of approximately 90%.

Figure 2Figure 2. Strategy for Yeast Clone Assembly
 Enlarge

The use of a PCR step in the production of the clones suggested the potential for the introduction of errors into the clones. To improve the chances of acquiring an error-free clone, 4 separate isolates for each gene were selected and processed into plasmid DNA, resulting in a total of 23,252 clones for 5813 genes. An unanticipated obstacle arising during the development of this project was the discovery that the recombination sites in the Entry vector prevent facile DNA sequencing from this vector. For this reason all 23,252 of the genes were subsequently transferred into a plasmid that was easier to sequence. As the vector chosen was a Gateway-modified galactose inducible yeast expression vector [Figure 3], this step also created yeast expression clones ready for immediate use in experiments.

Figure 3Figure 3. YeastExpression Plasmid pBY011
Enlarge

Sequencing Effort

The Yeast FLEXGene group at Ludwig Brazil has established an optimized high-throughput DNA sequencing method for the yeast expression clones produced by HIP. In the first round of sequence analysis, clones have been sequenced by end reads only. In the next round of sequencing, internal reads to determine full-length sequences for clones will be done with the 2 best clones identified from analysis of the end reads.

Ludwig Brazil has created a set of programs that analyze DNA sequence reads and establish whether the sequenced clones match the expected sequence for each gene. The results of this analysis are stored in a database created by Ludwig Brazil for this purpose. Ludwig Brazil has also created a web interface that allows other members of the Working Group to access the results of the sequence analysis. This reporting allows, for instance, HIP to re-array clones into 2 groups: fully sequenced clones (ready for use in experiments), and clones that require more sequencing. It also identifies the genes for which there is no acceptable clone.


Sequencing Results

Table 1. Summary of Sequencing Results from First Round

  Current (68%) Projected (100%)
Full sequence, good 930 1374
Partial sequence, mixed 2046 3023
Failed clones 1272 1880

At this time, approximately 68% of the yeast clones (representing an expected 4248 genes) have been sequenced and analyzed. From this analysis, clones that were fully sequenced and had no changes from the expected sequence were identified for 930 yeast genes (22%). Clones representing an additional 2046 yeast genes (48%) requiring further sequencing were also identified. No acceptable clones were identified for 1272 genes (30%); the majority (93%) of these failed clones represent false, or empty clones mostly from genes larger than 2000 nucleotides.

The projected totals after completion of the end read sequencing phase are 1374 fully sequenced clones with no sequence changes and 3023 partially sequenced genes. It is expected that 1880 genes will fail to produce acceptable clones in this round. However, as some of the partially sequenced clones have sequence changes, it is expected that for a fraction of the genes represented by this group there will be no acceptable clones. It would be reasonable to assume that an additional 500 genes might be lost in this way, resulting in the need to re-clone as many as 2500 genes. These genes will be re-cloned with improved methods in the finishing phase of the project. At this point, we have rearrayed 930 fully sequenced ORFs, and 1888 partially sequenced ORFs, for distribution to our collaborators as well as for experiments at HIP.

Finishing the Yeast FLEXGene Collection

Future work on the Yeast FLEXGene project will be divided into three phases: 1) a finishing phase of sequencing the Round 1 clones, 2) a finishing round of cloning, and 3) sequencing the finishing round clones. A final finishing round of cloning will, in all likelihood, be needed for the handful of genes that fail cloning a second time.



""
" "