Child pages
  • Gene-by-gene approach for characterization of genomes

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Suppose you have genomes and you want to characterize them. One of the ways to do that is to build a table of what genes are in each genome and what are not there.

  1. Create a local BLAST db of your genome sequence/contigs. One db per one genome.
  2. Create a file with sequences of genes you what to explore. This file will be the input file for the workflow.
  3. Setup location and name of BLAST db you created for the first genome.
  4. Setup output files: report location and output file with annotated (with BLAST) sequence. You might want to delete the "Write Sequence" element if you do not need output sequences.
  5. Run the workflow.
  6. Run the workflow on the same input and output files changing BLAST db for each genome that you have.

As the result you will get the report file. With "Yes" and "No" field. "Yes" answer means that the gene is in the genome. "No" answer MIGHT mean that there is no gene in the genome. It is a good idea to analyze all the "No" sequences using annotated files. Just open a file and find a sequence with a name of a gene that has "No" result.

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/3244629/Gene-by-gene approach for characterization of genomes.png"/>
  <br> 
</center>

Also, if required, you can change parameters. Use the workflow wizard to guide you through the parameters setup process. The first wizard page will appear when you click on the Show wizard button on the Workflow Designer toolbar:

 

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/3244589/Show_wizard.png"/>
  <br> 
</center>