in silico biology, inc.

Technology Information Site

Function Menu

IMC L01B Automatic Creation of Blast DBs by Loading Annotated Genome Sequence onto Reference Directory

This is a method to automatically generate the genome of a closely related species as a reference database for Blast search.

Menu: File-> Load Sequence Files to Reference…

  1. Load the genome (annotated) of the closely related species into the Reference Directory.
  2. Two types of Blast search databases are automatically generated when loading is completed.
  • Generates the base sequence of the entire genome as a nucleic acid DB.
  • Generated as an amino acid DB with each amino acid sequence of all CDSs as an entry.

The following is a batch compatibility search method using the above.

Menu: File-> Load Sequence File (s)

  1. Loads the unannotated genomic sequence into the Main Directory.
  2. If the loaded genomic base sequence is divided into multiple contigs, it is convenient to use the Join function to virtually create a single sequence. (See here for the Join function)

Menu: Analysis-> ORF Finder-> ORF Extraction…

  1. Perform ORF extraction on the unannotated genomic sequence that has been combined into one.
  2. Check Only longest ORFs are extracted….
  3. If the Search Range is highlighted in red, only part of the genome sequence is selected. If you want to analyze the total length, you need to change From To to specify the total length.
  4. When the ORF extraction is finished, the extracted ORF will be displayed on the main feature map.
  5. (There is also a method of mapping the gene identification result to the base sequence)

Menu: Analysis-> ORF Finder-> Translation

  1. Translation is performed on the result of ORF extraction.
  2. If you need to change the Codon Table, change its number
  3. When it finishes running, the ORF feature changes to a CDS feature.

Menu: Analysis-> Homology Search for Selected Feature Key

  1. Perform an homology search on the closely related genome loaded as the reference genome for each unannotated CDS on the amino acid-translated main feature map, automatically extract annotations from the hit subjects, and annotate. I will do it.

(Menu: Settings-> Feature Setting .. TAB: Auto Copy)

  1. (Use Feature Setting to fine-tune which information on the reference genome is extracted)
  2. When the execution is finished, the hit information is posted and annotated.

Menu: File-> Save as…

  1. Save the annotated file with a different name to save the results.
  2. (The result is automatically saved even if you do not save the file, but if you execute other analyzes repeatedly, it will be mixed with the result of the homology analysis.)