| <<O>> Difference Topic SupplementaryInformation1 (r1.11 - 04 Dec 2003 - JeffreyBarrick) |
| Changed: | |
| < < |
Bacillus subtilis Riboswitch CandidatesSupplementary Information. |
| > > |
New RNA Motifs Suggest an Expanded Scope for Riboswitches in Bacterial Genetic Control (Supplementary Methods and Website)Jeffrey E. Barrick, Keith A. Corbino, Wade C. Winkler, Ali Nahvi, Maumita Mandal, Jennifer Collins, Mark Lee, Adam Roth, Narasimhan Sudarsan, Inbal Jona, J. Kenneth Wickiser, and Ronald R. Breaker |
| <<O>> Difference Topic SupplementaryInformation1 (r1.10 - 21 Nov 2003 - JeffreyBarrick) |
| Changed: | |
| < < | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch-containing IGRs within the B. subtilis genome (J.E.B., data not shown). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| > > | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch-containing IGRs within the B. subtilis genome (J.E.B., data not shown). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package (2) with a gap opening penalty of 15 (-f -15). |
| Changed: | |
| < < | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (2, 3). Specifically, each annotated protein gene was filtered with the COILS 2.2 program (4) and compared to proteins in the COG database using BLASTPGP with default parameters. From these similarity results proteins were assigned to COGs by the local version of the COGnitor program. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| > > | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (3, 4). Specifically, each annotated protein gene was filtered with the COILS 2.2 program (5) and compared to proteins in the COG database using BLASTPGP with default parameters. From these similarity results proteins were assigned to COGs by the local version of the COGnitor program. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| Changed: | |
| < < | Intrinsic terminators were predicted using a the software program TransTerm? available from TIGR (5). The source code was modified to (1) ignore distinctions between head-to-tail and tail-to-tail intergenic regions when scoring terminators and (2) leave separate confidence values for overlapping terminators on opposite strands. The altered source for "smooth_confidence.perl" is available. Terminators with >98% confidence are high quality predictions. |
| > > | Intrinsic terminators were predicted using a the software program TransTerm? available from TIGR (6). The source code was modified to (1) ignore distinctions between head-to-tail and tail-to-tail intergenic regions when scoring terminators and (2) leave separate confidence values for overlapping terminators on opposite strands. The altered source for "smooth_confidence.perl" is available. Terminators with >98% confidence are high quality predictions. |
| Changed: | |
| < < | We manually aligned the BLAST hits from the most promising candidate intergenic regions to create initial models for each putative RNA structure. Matches of these RNA motifs consisting of blocks of consensus sequences and base pairing to our curated set of complete genomes were compiled using the program SequenceSniffer? (J.E.B., unpublished program). This program displays the relation of matches to nearby genes so that sequences regulating related genes are readily recognized. The published RNAMotif program allows more general searching for RNA patterns (6). Further BLAST matches to the conserved sequence families were found in other organisms with the NCBI Microbial BLAST page. We iteratively relaxed the RNA consensus model with each expanded alignment and repeated these searches until no new matches could be found. |
| > > | We manually aligned the BLAST hits from the most promising candidate intergenic regions to create initial models for each putative RNA structure. Matches of these RNA motifs consisting of blocks of consensus sequences and base pairing to our curated set of complete genomes were compiled using the program SequenceSniffer? (J.E.B., unpublished program). This program displays the relation of matches to nearby genes so that sequences regulating related genes are readily recognized. The published RNAMotif program allows more general searching for RNA patterns (7). Further BLAST matches to the conserved sequence families were found in other organisms with the NCBI Microbial BLAST page. We iteratively relaxed the RNA consensus model with each expanded alignment and repeated these searches until no new matches could be found. |
| Changed: | |
| < < | 2. Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. |
| > > | 2. Pearson, W.R. 2000. Flexible Sequence Similarity Searching with the FASTA3 Program Package. In Bioinformatics Methods and Protocols. (eds. S. Misener, and S.A. Krawetz), pp. 185-219. Humana Press, Totowa, NJ. |
| Changed: | |
| < < | 3. Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. |
| > > | 3. Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. |
| Changed: | |
| < < | 4. Lupas, A. 1996. Prediction and analysis of coiled-coil structures. Method Enzymol 266: 513-525. |
| > > | 4. Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. |
| Changed: | |
| < < | 5. Ermolaeva, M.D., Khalak, H.G., White, O., Smith, H.O., and Salzberg, S.L. 2000. Prediction of transcription terminators in bacterial genomes. J. Mol. Biol. 301: 27-33. |
| > > | 5. Lupas, A. 1996. Prediction and analysis of coiled-coil structures. Method Enzymol 266: 513-525. |
| Changed: | |
| < < | 6. Macke, T.J., Ecker, D.J., Gutell, R.R., Gautheret, D., Case, D.A., and Sampath, R. 2001. RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res. 29: 4724-4735. |
| > > | 6. Ermolaeva, M.D., Khalak, H.G., White, O., Smith, H.O., and Salzberg, S.L. 2000. Prediction of transcription terminators in bacterial genomes. J. Mol. Biol. 301: 27-33. 7. Macke, T.J., Ecker, D.J., Gutell, R.R., Gautheret, D., Case, D.A., and Sampath, R. 2001. RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res. 29: 4724-4735. |
| <<O>> Difference Topic SupplementaryInformation1 (r1.9 - 14 Nov 2003 - JeffreyBarrick) |
| Changed: | |
| < < | The BLISS database integrates comparative genomics information to enable riboswitch discovery in Bacillus subtilis. It begins with automatically generated alignments seeded by BLAST hits between intergenic regions (IGRs) from fully sequenced bacterial genomes and incorporates uniform predictions of gene functions and intrinsic terminators. A web interface to the database allows intergenic regions to be sorted based on genome position or statistics derived from its sequence alignment. An integrated system for collaborative annotation faciliates the exhaustive manual examination of these IGRs for riboswitch candidates. |
| > > | The BLISS database integrates comparative genomics information to enable riboswitch discovery in Bacillus subtilis. It begins with automatically generated alignments seeded by BLAST hits between intergenic regions (IGRs) from fully sequenced bacterial genomes and incorporates uniform predictions of gene functions and intrinsic terminators. A web interface to the database allows intergenic regions to be sorted based on genome position or statistics derived from its sequence alignment. An integrated system for collaborative annotation facilitates the exhaustive manual examination of these IGRs for riboswitch candidates. |
| Changed: | |
| < < | The BLISS database links intergenic regions to the TWiki collaboration tool. TWiki allows webpages to be edited by registered users and supports full version control to record a history of all page edits. A separate TWiki webpage for each intergenic region is automatically generated by BLISS when a user chooses to add annotation. Keywords within these pages are recognized to priminently display information on the sortable list of IGRs. Our lab has used these pages to record known riboswitches, transcription-factor binding sites, T boxes, noncoding RNAs and other sequence features in B. subtilis IGRs that cause clusters of BLAST hits. Every intergenic region with at least We have also annotated literature references, observations, and less-tangibl as well as less-tangible ratings of conservation for. intangible observations, references, and coordinate an effort to exhaustively examine each IGR for a riboswitch. |
| > > | The BLISS database links intergenic regions to the open source TWiki collaboration tool. TWiki allows webpages to be edited by any registered user and supports full version control to record a history of all page edits. BLISS generates a separate TWiki webpage for each intergenic region automatically when a user chooses to add annotation. Keywords within these pages are recognized by the web interface to prominently display information on the sortable list of IGRs. Our lab has used these pages to record known riboswitches, transcription-factor binding sites, T boxes, noncoding RNAs and other sequence features in B. subtilis IGRs that cause clusters of BLAST hits. Every remaining intergenic region alignment with at least 5 sequences has been examined for conservation indicative of a regulatory RNA motif. |
| Changed: | |
| < < |
RNA PhylogeniesThe most promising candidates. SequenceSniffer?. Genomic BLAST. |
| > > |
Candidate Phylogenies |
| Changed: | |
| < < | SequenceSniffer? allows simple RNA motif searching for blocks of consensus sequence and base pairing. resulting matches with nearby genes. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. |
| > > | We manually aligned the BLAST hits from the most promising candidate intergenic regions to create initial models for each putative RNA structure. Matches of these RNA motifs consisting of blocks of consensus sequences and base pairing to our curated set of complete genomes were compiled using the program SequenceSniffer? (J.E.B., unpublished program). This program displays the relation of matches to nearby genes so that sequences regulating related genes are readily recognized. The published RNAMotif program allows more general searching for RNA patterns (6). Further BLAST matches to the conserved sequence families were found in other organisms with the NCBI Microbial BLAST page. We iteratively relaxed the RNA consensus model with each expanded alignment and repeated these searches until no new matches could be found. |
| Changed: | |
| < < | 1. Salgado, H., Moreno-Hagelsieb, G., Smith, T.F., and Collado-Vides, J. 2000. Operons in Escherichia coli: Genomic analyses and predictions. Proc. Natl. Acad. Sci. U. S. A. 97: 6652-6657. |
| > > | 1. Salgado, H., Moreno-Hagelsieb, G., Smith, T.F., and Collado-Vides, J. 2000. Operons in Escherichia coli: Genomic analyses and predictions. Proc. Natl. Acad. Sci. U. S. A. 97: 6652-6657. |
| Changed: | |
| < < | |
| > > | 6. Macke, T.J., Ecker, D.J., Gutell, R.R., Gautheret, D., Case, D.A., and Sampath, R. 2001. RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res. 29: 4724-4735. |
| Added: | |
| > > | |
| <<O>> Difference Topic SupplementaryInformation1 (r1.8 - 14 Nov 2003 - JeffreyBarrick) |
| Changed: | |
| < < | Links |
| > > |
Links |
| Changed: | |
| < < | |
| > > | |
| Added: | |
| > > |
BLISS OverviewThe BLISS database integrates comparative genomics information to enable riboswitch discovery in Bacillus subtilis. It begins with automatically generated alignments seeded by BLAST hits between intergenic regions (IGRs) from fully sequenced bacterial genomes and incorporates uniform predictions of gene functions and intrinsic terminators. A web interface to the database allows intergenic regions to be sorted based on genome position or statistics derived from its sequence alignment. An integrated system for collaborative annotation faciliates the exhaustive manual examination of these IGRs for riboswitch candidates. |
| Changed: | |
| < < | A complete list of the genomes analyzed is available. Genome sequences were downloaded in Genbank format from the NCBI bacterial reference sequence list. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. Organisms were classified into broad taxonomic groups based on the information in Genbank records and the Complete Microbial Resource at TIGR. Our three-letter organism abbreviations are derived from the COG database when possible. |
| > > | A complete list of the genomes analyzed is available. Genome sequences were downloaded in Genbank format from the NCBI bacterial reference sequence list. Genes on the same strand separated by fewer than 30 nt are usually part of the same transcriptional unit (1) and are not large enough to harbor structured RNA sequences. Therefore, we only considered IGRs with a length of at least 30 nt. Organisms were classified into broad taxonomic groups based on the information in Genbank records and the Complete Microbial Resource at TIGR. Our three-letter organism abbreviations are derived from the COG database when possible. |
| Changed: | |
| < < |
BLAST Comparisons of Intergenic Regions |
| > > |
IGR Sequence Comparisons |
| Changed: | |
| < < | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch containing IGRs within the B. subtilis genome (J.E.B., data not shown). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| > > | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch-containing IGRs within the B. subtilis genome (J.E.B., data not shown). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| Changed: | |
| < < |
COGnitor Assignment of Genes |
| > > |
Gene Function Predictions |
| Changed: | |
| < < | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (1, 2). Specifically, each annotated protein gene was filtered with the COILS 2.2 program (3) and compared to proteins in the COG database using BLASTPGP with default parameters. From these similarity results proteins were assigned to COGs by the local version of the COGnitor program. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| > > | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (2, 3). Specifically, each annotated protein gene was filtered with the COILS 2.2 program (4) and compared to proteins in the COG database using BLASTPGP with default parameters. From these similarity results proteins were assigned to COGs by the local version of the COGnitor program. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| Changed: | |
| < < |
Terminator Prediction |
| > > |
Terminator PredictionsIntrinsic terminators were predicted using a the software program TransTerm? available from TIGR (5). The source code was modified to (1) ignore distinctions between head-to-tail and tail-to-tail intergenic regions when scoring terminators and (2) leave separate confidence values for overlapping terminators on opposite strands. The altered source for "smooth_confidence.perl" is available. Terminators with >98% confidence are high quality predictions.IGR AnnotationThe BLISS database links intergenic regions to the TWiki collaboration tool. TWiki allows webpages to be edited by registered users and supports full version control to record a history of all page edits. A separate TWiki webpage for each intergenic region is automatically generated by BLISS when a user chooses to add annotation. Keywords within these pages are recognized to priminently display information on the sortable list of IGRs. Our lab has used these pages to record known riboswitches, transcription-factor binding sites, T boxes, noncoding RNAs and other sequence features in B. subtilis IGRs that cause clusters of BLAST hits. Every intergenic region with at least We have also annotated literature references, observations, and less-tangibl |
| Changed: | |
| < < | Intrinsic terminators were predicted using a the software program TransTerm? available from TIGR (4). The source code was modified to (1) ignore distinctions between head-to-tail and tail-to-tail intergenic regions when scoring terminators and (2) leave separate confidence values for overlapping terminators on opposite strands. The altered source for "smooth_confidence.perl" is available. Terminators with >98% confidence are high quality predictions. |
| > > | as well as less-tangible ratings of conservation for. |
| Changed: | |
| < < |
BLISS Annotation |
| > > | intangible observations, references, and coordinate an effort to exhaustively examine each IGR for a riboswitch. The archived pages presented here have editing disabled so that a snapshot of the annotation process has been preserved. See the current BLISS pages for the ongoing annotation effort. |
| Changed: | |
| < < |
BLISS Explanation |
| > > | The most promising candidates. SequenceSniffer?. Genomic BLAST. SequenceSniffer? allows simple RNA motif searching for blocks of consensus sequence and base pairing. resulting matches with nearby genes. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. |
| Changed: | |
| < < | 1. Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. |
| > > | 1. Salgado, H., Moreno-Hagelsieb, G., Smith, T.F., and Collado-Vides, J. 2000. Operons in Escherichia coli: Genomic analyses and predictions. Proc. Natl. Acad. Sci. U. S. A. 97: 6652-6657. |
| Changed: | |
| < < | 2. Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. |
| > > | 2. Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. |
| Changed: | |
| < < | 3. Lupas, A. 1996. Prediction and analysis of coiled-coil structures. Method Enzymol 266: 513-525. |
| > > | 3. Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. |
| Changed: | |
| < < | 4. Ermolaeva, M.D., Khalak, H.G., White, O., Smith, H.O., and Salzberg, S.L. 2000. Prediction of transcription terminators in bacterial genomes. J. Mol. Biol. 301: 27-33. |
| > > | 4. Lupas, A. 1996. Prediction and analysis of coiled-coil structures. Method Enzymol 266: 513-525. 5. Ermolaeva, M.D., Khalak, H.G., White, O., Smith, H.O., and Salzberg, S.L. 2000. Prediction of transcription terminators in bacterial genomes. J. Mol. Biol. 301: 27-33. |
| <<O>> Difference Topic SupplementaryInformation1 (r1.7 - 03 Nov 2003 - JeffreyBarrick) |
| Changed: | |
| < < | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch containing IGRs within the B. subtilis genome (J.E.B., unpublished data). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| > > | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch containing IGRs within the B. subtilis genome (J.E.B., data not shown). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| <<O>> Difference Topic SupplementaryInformation1 (r1.6 - 09 Oct 2003 - JeffreyBarrick) |
| Changed: | |
| < < |
|
| > > | |
| Changed: | |
| < < | |
| > > | We used version 2.2.5 of the BLAST package to compare Bacillus subtilis intergenic regions to intergenic region databases for every other genome. The program BLASTN was used with a word size of 7 nucleotides, a gap open penalty of 2, a gap extension penalty of 2, and a nucleotide penalty of 2 (-W 7 -G 2 -E 2 -q -2). These parameters were found to maximize the ratio of known positive comparisons (hits between two riboswitches) to known negative comparisons (hits between riboswitch intergenic regions and other IGRs) with a set of known riboswitch containing IGRs within the B. subtilis genome (J.E.B., unpublished data). BLAST results were symmetrized by taking the higher E-value for each pair of unidirectional hits between two intergenic regions. Bidirectional hits with E-values <= 0.01 were individually aligned to the B. subtilis sequence using the program ssearch34 from version 3.4 of the FASTA package with a gap opening penalty of 15 (-f -15). |
| Changed: | |
| < < | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (1, 2). Specifically, each annotated protein gene was filtered with the COILS2 program (3) and compared to proteins in the COG database using BLASTPGP with default parameters. Proteins were assigned to COGs by the local version of the COGnitor program from these similarity results. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| > > | We used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (1, 2). Specifically, each annotated protein gene was filtered with the COILS 2.2 program (3) and compared to proteins in the COG database using BLASTPGP with default parameters. From these similarity results proteins were assigned to COGs by the local version of the COGnitor program. Proteins that are the results of gene fusions are often assigned to multiple COGs. |
| Changed: | |
| < < | Descriptive gene names for each COG were assigned from the "whog" file with the following priority: (1) If the COG contains an E. coli gene then use this name, (2) If the COG contains a B. subtilis gene then use this name, (3) Otherwise do not assign a name, designated with a dash. |
| > > | Gene descriptions and names for each COG are derived from the "whog" file of the database distribution. Gene names were assigned from identified genes in a COG with the following priority: (1) If the COG contains an E. coli gene then use this name, (2) If the COG contains a B. subtilis gene then use this name, (3) Otherwise do not assign a name (designated with a dash). |
| Added: | |
| > > |
BLISS AnnotationRNA Phylogenies |
| Added: | |
| > > | %META:FILEATTACHMENT{name="smooth_confidence.perl" attr="h" comment="" date="1065736944" path="smooth_confidence.perl" size="26098" user="JeffreyBarrick" version="1.1"}% |
| <<O>> Difference Topic SupplementaryInformation1 (r1.5 - 09 Oct 2003 - JeffreyBarrick) |
| Added: | |
| > > |
|
| Changed: | |
| < < | Supplementary Information for [PaperX]. |
| > > | Supplementary Information. |
| Deleted: | |
| < < | Methods |
| Changed: | |
| < < |
COGNITOR Assignment of Genes |
| > > |
COGnitor Assignment of GenesWe used the COG database (September 2003) to uniformly assign gene functions to the genomic data sets (1, 2). Specifically, each annotated protein gene was filtered with the COILS2 program (3) and compared to proteins in the COG database using BLASTPGP with default parameters. Proteins were assigned to COGs by the local version of the COGnitor program from these similarity results. Proteins that are the results of gene fusions are often assigned to multiple COGs. Descriptive gene names for each COG were assigned from the "whog" file with the following priority: (1) If the COG contains an E. coli gene then use this name, (2) If the COG contains a B. subtilis gene then use this name, (3) Otherwise do not assign a name, designated with a dash. |
| Added: | |
| > > | Intrinsic terminators were predicted using a the software program TransTerm? available from TIGR (4). The source code was modified to (1) ignore distinctions between head-to-tail and tail-to-tail intergenic regions when scoring terminators and (2) leave separate confidence values for overlapping terminators on opposite strands. The altered source for "smooth_confidence.perl" is available. Terminators with >98% confidence are high quality predictions. |
| Added: | |
| > > |
References1. Tatusov, R.L., Koonin, E.V., and Lipman, D.J. 1997. A genomic perspective on protein families. Science 278: 631-637. 2. Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 22-28. 3. Lupas, A. 1996. Prediction and analysis of coiled-coil structures. Method Enzymol 266: 513-525. 4. Ermolaeva, M.D., Khalak, H.G., White, O., Smith, H.O., and Salzberg, S.L. 2000. Prediction of transcription terminators in bacterial genomes. J. Mol. Biol. 301: 27-33. |
| <<O>> Difference Topic SupplementaryInformation1 (r1.4 - 02 Oct 2003 - JeffreyBarrick) |
| Changed: | |
| < < | A complete list of the genomes analyzed is available. Genome sequences were downloaded in Genbank format from the NCBI bacterial reference sequence list. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. Organisms were classified into broad taxonomic groups based on the information in Genbank records and the Complete Microbial Resource at TIGR. Our three-letter organism abbreviations are derived from the COG database when possible. |
| > > | A complete list of the genomes analyzed is available. Genome sequences were downloaded in Genbank format from the NCBI bacterial reference sequence list. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. Organisms were classified into broad taxonomic groups based on the information in Genbank records and the Complete Microbial Resource at TIGR. Our three-letter organism abbreviations are derived from the COG database when possible. |
| <<O>> Difference Topic SupplementaryInformation1 (r1.3 - 28 Sep 2003 - JeffreyBarrick) |
| Changed: | |
| < < |
Bacillus subtilis Riboswitch Candidates |
| > > |
Bacillus subtilis Riboswitch Candidates |
| Added: | |
| > > |
ContentsTOC: No TOC in "Main.SupplementaryInformation1" |
| Changed: | |
| < < |
|
| > > |
|
| Added: | |
| > > |
|
| Changed: | |
| < < | |
| > > |
Genome SequencesA complete list of the genomes analyzed is available. Genome sequences were downloaded in Genbank format from the NCBI bacterial reference sequence list. BLAST searches for members of conserved sequence families in other organisms were conducted using the NCBI Microbial BLAST page. Organisms were classified into broad taxonomic groups based on the information in Genbank records and the Complete Microbial Resource at TIGR. Our three-letter organism abbreviations are derived from the COG database when possible.BLAST Comparisons of Intergenic RegionsCOGNITOR Assignment of GenesTerminator Prediction |
| Added: | |
| > > |
BLISS Explanation |
| <<O>> Difference Topic SupplementaryInformation1 (r1.2 - 23 Sep 2003 - JeffreyBarrick) |
| Changed: | |
| < < |
|
| > > | |
| <<O>> Difference Topic SupplementaryInformation1 (r1.1 - 22 Sep 2003 - JeffreyBarrick) |
| Added: | |
| > > |
%META:TOPICINFO{author="JeffreyBarrick" date="1064266117" format="1.0" version="1.1"}%
%META:TOPICPARENT{name="WebHome"}%
Bacillus subtilis Riboswitch CandidatesSupplementary Information for [PaperX]. Links
|
| Topic SupplementaryInformation1 . { View | Diffs | r1.11 | > | r1.10 | > | r1.9 | More } |
|
Revision r1.1 - 22 Sep 2003 - 21:28 GMT - JeffreyBarrick Revision r1.11 - 04 Dec 2003 - 16:16 GMT - JeffreyBarrick |