IMC D1B6 Load GBFF Format Files

How to load GBFF file into IMC.

The GBFF file is a compressed file containing a large number of GenBank format files and consists of two files.

One has a file extension "*. gbff.gz" in the definition file (eg bacteria.1002.genomic.gbff.gz).

The other has a file extension "*. fna.gz" in the sequence file (eg bacteria.1002.1.genomic.fna.gz).

Operation

  1. Save the above two files in an arbitrary place.
  2. From the menu, click "Tools -> Multi GenBank File Expander ...".
  3. The "Expand Setting" dialog is displayed.
  4. For "Multi GenBank Format File" text field,  click "Ref ..." to specify the saved "*. gbff.gz" file.
  5. For "fna File" text field, click "Ref ..." to specify the saved "*. fna.gz" file.
  6. For "Expand Directory" text field, specify the directory where you want to expand the GBFF file.
  7. Click "Run".
  8. A confirmation message "Start Expand?" is displayed.
  9. Click "Yes".
  10. Execution starts, and a progress message is displayed during execution.
  11. When the progress message is closed, expansion is completed.
  12. Following taxonomy below the directory specified for expansion, it is expanded hierarchically.

Loading to the IMC is as follows.

  1. Right-click on the main directory tree root.
  2. The menu will be displayed.
  3. Click "Change Root Directory ...".
  4. A confirmation message "Change Root Diretory?" is displayed.
  5. Click "Yes".
  6. The file selection dialog is displayed.
  7. Specify the directory where you previously specified expansion.
  8. The expanded directory is changed to the root directory.
  9. Click + mark to expand the tree manually.
  10. When the sequence node is displayed, click on one "*. gbk" file.
  11. A confirmation message "Change Current Directory?" is displayed.
  12. If the directory to which the sequence belongs contains a very large number of files, loading takes time.
  13. In such a case, it is recommended to manually split the directory.
  14.  
  15. Click "Yes".
  16. The directory containing the sequence becomes the current directory and all files belonging to it are loaded.
  17. While loading, a progress message will be displayed.
  18. The clicked gbk file becomes the current file and it is displayed in the main feature map.