Child pages
  • Searching NCBI Genbank
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

UGENE allows searching data in NCBI GenBank remote database. To do this open the following dialog by File->Search NCBI Genbank main menu:



 

The Nucleotide, Genome Survey Sequence (GSS), and Expressed Sequence Tag (EST) database all contain nucleic acid sequences. The data in GSS and EST are from two large bulk sequence divisions of GenBank. GSS and EST data are typically uncharacterized, short genomic (GSS) or cDNA (EST) sequences.

Searching any of the three databases will provide links to results in the other. Unless you know that you are trying to find a specific set of EST or GSS sequences, searching the Nucleotide database with general text queries will produce the most relevant results. You can always follow links to results in EST and GSS from the Nucleotide database results.

Image qsfig1.jpg

How do I use a simple query, such as a word or a phrase?

You can use a protein name, gene name, or gene symbol directly. Searching with a submitter or author name in the following format will produce the best results.

Smith JR (last name followed by initials, no punctuation)

Database identifiers such as accession numbers or gi numbers will directly retrieve the full sequence record.

CAA79696
NP_778203
263191547
BC043443
NM_002020

To find a match to an exact phrase, enclose it in quotation marks.

"contactin associated protein"
"duchenne muscular dystrophy"

How can I make my search more specific with Boolean operators (AND, OR, NOT)?

Use the Boolean operator AND to find records that contain every one of your search terms, the intersection of search results.

contactin AND neurofascin          Protein          Nucleotide 

Use the Boolean operator OR to find records that include one of several search terms, the union of search results.

contactin OR neurofascin           Protein          Nucleotide 

Use the Boolean operator NOT to exclude records matching a search term

contactin NOT neurofascin          Protein         Nucleotide

How do I restrict my search to specific subsets of records such as those from a specific organism, molecule type, source database, genomic or cDNA library name or properties?

You can use the Limits page to limit your search to only certain kinds of records. You can also use the Filter your results to select categories of records after a search. Follow these links to jump to the limit of interest: organismmolecule typesource databaselibrary name or properties.

Limits

Use the Limits page linked to top of any of the Protein, Nucleotide, GSS, or EST webpages to select the appropriate limit from the various pull-down lists.

 

 

 

To fetch data select the File ‣ Access remote database... item in the main menu.

The dialog will appear:



Here you need to enter unique id of the biological object and choose a database. Unique identifiers are different for various databases. For example, for NCBI GenBank such unique id could be Accession Number or NCBI GI number.

Optionally, you can browse for a directory to save the fetched file to.

After you click the OK button, UGENE downloads the biological object (DNA sequence, protein sequence, 3d model, etc.) and adds it to the current project.

If something goes wrong check the Log View, it will help you to diagnose the problem.

  • No labels