Child pages
  • Find Substrings in Sequences

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. The workflow reads sequences from the input sequence files (e.g. GenBank). The input data may also contain the annotations, associated with the sequences.
  2. The workflow reads text strings (patterns) from the input text files.
  3. The data are multiplexed using the Multiplexer element. Multiplexing rule "1 to many" is used, so each input sequence is concatenated with each pattern. The result is concatenating results are sent to the Find Substrings element.
  4. The Find Substrings element searches for the specified patterns in each sequence.
  5. The next element Grouper merges annotations, read for the sequence in the Read Sequence element, with annotations, found for the sequence by Find Substrings element. A sequence ID is used to group the appropriate sets of annotations.
  6. And finally, the data are written to the output file ("substrings.gb" , by default).

...