IMC P05A How to Use Amino Acid Sequence Mapping

  • Map the amino acid sequence on the reference genome sequence and register it as a new feature
  • Map the specified amino acid sequence (s) to the reference genome currently displayed as a feature map, and register the hit entry as a new feature (such as mRNA).
  • The mapping uses the tBlastN algorithm.

How to operate amino acid sequence mapping

  1. Read the base sequence of the reference genome and display it on the feature map.
  2. IMC732 H301 0001
  3. Click Genome Analysis-> Mapping-> A.A. Sequence Mapping from the menu.
  4. IMC732 H301 0002
  5. A.A.Sequence Mapping execution dialog is displayed.
  6. IMC732 H301 0003
  7. Click Ref… of A.A.Sequence File (s) and specify the amino acid sequence to be mapped.
  8. The amino acid sequence that can be specified here is FastA format, GenPept format, etc. Files compressed with gz can also be specified.
  9. A dialog box for selecting an amino acid sequence appears.
  10. Specify the amino acid sequence file.
  11. IMC732 H301 0004
  12. Click the Run button.
  13. A confirmation message will be displayed.
  14. IMC732 H301 0005
  15. To change a parameter, click the Parameter button.
  16. IMC732 H301 0006
  17. This parameter is based on NCBI Blast parameter settings.
  18. Usually use the default parameter settings.
  19. Click the "Yes" button to start mapping.
  20. The execution time is proportional to the number and length of amino acid sequences and the base length of the reference genome base sequence.
  21. During execution, a message that indicates the processing progress is displayed.
  22. IMC732 H301 0009
  23. When the mapping process is completed, the result list dialog is displayed.

  24. IMC732 H301 0010
  25. Clicking on Alignment in each row will bring up a pairwise alignment dialog.
  26. IMC732 H301 0011
  27. Register a new feature using the result list dialog.
  28. Mapping results are classified into three types.
    • When the input amino acid perfectly matches the reference genome sequence
    • When the input amino acid incompletely matches the reference genome sequence
    • When the input amino acid sequence does not match at all
  29. The former two types of results can be registered as new features on the reference genome. For example, a perfect match sequence can be registered as an mRNA feature, and an
  30. incomplete match sequence can be registered as a miscRNA feature. It is also possible to specify another feature key such as CDS.
  31. Click Insert Feature.
  32. A confirmation message will be displayed.
  33. IMC732 H301 0013
  34. Click “Yes”.
  35. The Feature Setting dialog is displayed.
  36. IMC732 H301 0015
  37. The same value can be set to any qualifier of the newly registered feature. Also, you can register the prefix specified for Qualifier / locus_id and a serial number that has an arbitrary number of digits and starts with an arbitrary numerical value.
  38. IMC732 H301 0015
  39. In the case of eukaryotes with introns, exon and intron regions are identified and registered as features.
  40. You can limit the maximum base length of an intron.
  41. Click Run.
  42. When Feature registration is completed, a completion message is displayed.
  43. IMC732 H301 0016
  44. Click OK.
  45. The features registered in the main feature map are displayed.
  46. IMC732 H301 0017