Child pages
  • Search for TFBS

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

This sample shows how to search for transcription factor binding sites (TFBS) using two different approaches - weight matrices and SITECON models - and write the found TFBS annotations into one output file.

The workflow steps are these:

  1. The workflow reads the input sequences.
  2. Each sequence goes to the TFBS searching elements.
  3. Read Weight Matrix reads the input weight matrices.  Read SITECON Model reads the input SITECON models. The data are also transferred to the TFBS searching elements.
  4. Each TFBS searching element produces the corresponding annotations.
  5. After that the two annotation data flows are multiplexed into one data flow.
  6. The multiplexed data and are written to the output file ("merged.gb", by default).

Tip
titleHow to Use This Sample

If you haven't used the workflow samples in UGENE before, look at the "How to Use Sample Workflows" section of the documentation.

Workflow Sample Location

The workflow sample "Search for TFBS" can be found in the "Data Merging" section of the Workflow Designer samples.

Workflow Image

The workflow looks as follows:

HTML
<center>
  <br>
  <img src="/wiki/download/attachments/16122698/Search for TFBS.png"/>
  <br> 
</center>
Workflow Wizard

The wizard has 3 pages.

  1. Input sequence(s): On this page you must input sequence(s). 

    HTML
    <center>
      <br>
      <img src="/wiki/download/attachments/16122698/Search for TFBS_1.png"/>
      <br> 
    </center>
  2. Search for TFBS parameters: On this page you can modify search for TFBS parameters. 

    HTML
    <center>
      <br>
      <img src="/wiki/download/attachments/16122698/Search for TFBS_2.png"/>
      <br> 
    </center>

    The following parameters are available:

    Weight MatrixSemicolon-separated list of paths to the input files.
    Result annotationAnnotation name for marking found regions.
    Search inWhich strands should be searched: direct, complement or both.

    Min score

    Minimum score to detect transcription factor binding site

     

    SITECON model

    Semicolon-separated list of paths to the input files.

     

    Result annotation

    Annotation name for marking found regions.

     

    Search in

    Which strands should be searched: direct, complement or both.

     

    Min score

    Minimum score to detect transcription factor binding site

     

    Min err1Alternative setting for filtering results, minimal value of Error type I.
    Note that all thresholds (by score, by err1 and by err2) are applied when filtering results.
    Max err2Alternative setting for filtering results, max value of Error type II.
    Note that all thresholds (by score, by err1 and by err2) are applied when filtering results.
  3. Output data: On this page you can modify output parameters. 

    HTML
    <center>
      <br>
      <img src="/wiki/download/attachments/16122698/Search for TFBS_3.png"/>
      <br> 
    </center>

    The following parameters are available:

    Result file

     Location of output data file. If this attribute is set, slot "Location" in port will not be used.

    Accumulate results

    Accumulate all incoming data in one file or create separate files for each input.In the latter case, an incremental numerical suffix is added to the file name.