Genome Analysis Software Info Site

Official Technology Information Site of in silico biology, inc.

Welcome, Guest
Username: Password: Secret Key Remember me
  • Page:
  • 1

TOPIC:

GT: What is the meaning of the two values of "Splitting Option" in the fragment mapping? 12 years 10 months ago #525

  • AgentUser-sp-1336637799
  • AgentUser-sp-1336637799's Avatar Topic Author
  • Online
  • Posts: 83
There are two option setting values of "Splitting Option" in the fragment mapping.
One is "Split Length", another is "Overlap Length".
Why and how do I set them.
Attachments:

Please Log in to join the conversation.

Last edit: by AgentUser-sp-1336637799.

Re: GT: What is the meaning of the two values of "Splitting Option" in the fragment mapping? 12 years 10 months ago #526

  • akr-sp-1212728882
  • akr-sp-1212728882's Avatar
  • Online
  • Posts: 395
Reference genome sequences have better to be divided into fragments of same size because such split reference sequences require less memory and fast loading time than its whole sequence. Each neighboring pair of fragments in the reference sequences have better to overlap each other because reads to be mapped on the edges of fragments might be ignored unless the overlap regions exist.

The split length should be determined by the memory size of the GT installed computer, and the average read count that are mapped on one split fragment. 100,000 reads per one split fragment is a recommended parameter as the split length in case of the 8GB memory.
The overlap length should be determined by the read length. In case of 50bp for each read, 50 is recommended parameter as the overlap length even though there is a possibility of multiple mapping on the overlapped regions of the both sides of neighboring fragment.

Please Log in to join the conversation.

  • Page:
  • 1
Time to create page: 0.044 seconds