Short Oligonucleotide Analysis Package: Difference between revisions
reference clean up; history; reference to v3; expansion of other sections |
move tool description to its own section |
||
Line 2: | Line 2: | ||
'''SOAP''' (Short Oligonucleotide Analysis Package) is a suite of [[bioinformatics]] software tools enabling the assembly, alignment, and analysis of [[DNA_sequencing#Next-generation_methods|next generation DNA sequencing]] data. It is particularly suited to short read data. |
'''SOAP''' (Short Oligonucleotide Analysis Package) is a suite of [[bioinformatics]] software tools enabling the assembly, alignment, and analysis of [[DNA_sequencing#Next-generation_methods|next generation DNA sequencing]] data. It is particularly suited to short read data. |
||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
⚫ | |||
All programs in the SOAP package may be used free of charge and are distributed under the [[GPL]]. |
All programs in the SOAP package may be used free of charge and are distributed under the [[GPL]]. |
||
== Functionality == |
|||
⚫ | |||
=== Sequence Alignment === |
|||
⚫ | ''SOAPaligner'' is specifically designed for fast alignment of short reads and performs favorably with respect to similar alignment tools such as [[Bowtie]] and [[MAQ]] <ref name="LiYu2009">{{cite journal|last1=Li|first1=R.|last2=Yu|first2=C.|last3=Li|first3=Y.|last4=Lam|first4=T.-W.|last5=Yiu|first5=S.-M.|last6=Kristiansen|first6=K.|last7=Wang|first7=J.|title=SOAP2: an improved ultrafast tool for short read alignment|journal=Bioinformatics|volume=25|issue=15|year=2009|pages=1966–1967|issn=1367-4803|doi=10.1093/bioinformatics/btp336}}</ref>. |
||
=== Genome Assembly === |
|||
⚫ | ''SOAPdenovo'' is a short read denovo assembler. It is optimized for short reads such as that generated by [[Illumina (company)|Illumina]] and is capable of assembling large genomes such as the human genome <ref name="LiZhu2009">{{cite journal|last1=Li|first1=R.|last2=Zhu|first2=H.|last3=Ruan|first3=J.|last4=Qian|first4=W.|last5=Fang|first5=X.|last6=Shi|first6=Z.|last7=Li|first7=Y.|last8=Li|first8=S.|last9=Shan|first9=G.|last10=Kristiansen|first10=K.|last11=Li|first11=S.|last12=Yang|first12=H.|last13=Wang|first13=J.|last14=Wang|first14=J.|title=De novo assembly of human genomes with massively parallel short read sequencing|journal=Genome Research|volume=20|issue=2|year=2009|pages=265–272|issn=1088-9051|doi=10.1101/gr.097261.109}}</ref>. ''SOAPdenovo'' was used to assemble the genome of the [[giant panda]] <ref name="LiFan2009">{{cite journal|last1=Li|first1=Ruiqiang|last2=Fan|first2=Wei|last3=Tian|first3=Geng|last4=Zhu|first4=Hongmei|last5=He|first5=Lin|last6=Cai|first6=Jing|last7=Huang|first7=Quanfei|last8=Cai|first8=Qingle|last9=Li|first9=Bo|last10=Bai|first10=Yinqi|last11=Zhang|first11=Zhihe|last12=Zhang|first12=Yaping|last13=Wang|first13=Wen|last14=Li|first14=Jun|last15=Wei|first15=Fuwen|last16=Li|first16=Heng|last17=Jian|first17=Min|last18=Li|first18=Jianwen|last19=Zhang|first19=Zhaolei|last20=Nielsen|first20=Rasmus|last21=Li|first21=Dawei|last22=Gu|first22=Wanjun|last23=Yang|first23=Zhentao|last24=Xuan|first24=Zhaoling|last25=Ryder|first25=Oliver A.|last26=Leung|first26=Frederick Chi-Ching|last27=Zhou|first27=Yan|last28=Cao|first28=Jianjun|last29=Sun|first29=Xiao|last30=Fu|first30=Yonggui|last31=Fang|first31=Xiaodong|last32=Guo|first32=Xiaosen|last33=Wang|first33=Bo|last34=Hou|first34=Rong|last35=Shen|first35=Fujun|last36=Mu|first36=Bo|last37=Ni|first37=Peixiang|last38=Lin|first38=Runmao|last39=Qian|first39=Wubin|last40=Wang|first40=Guodong|last41=Yu|first41=Chang|last42=Nie|first42=Wenhui|last43=Wang|first43=Jinhuan|last44=Wu|first44=Zhigang|last45=Liang|first45=Huiqing|last46=Min|first46=Jiumeng|last47=Wu|first47=Qi|last48=Cheng|first48=Shifeng|last49=Ruan|first49=Jue|last50=Wang|first50=Mingwei|last51=Shi|first51=Zhongbin|last52=Wen|first52=Ming|last53=Liu|first53=Binghang|last54=Ren|first54=Xiaoli|last55=Zheng|first55=Huisong|last56=Dong|first56=Dong|last57=Cook|first57=Kathleen|last58=Shan|first58=Gao|last59=Zhang|first59=Hao|last60=Kosiol|first60=Carolin|last61=Xie|first61=Xueying|last62=Lu|first62=Zuhong|last63=Zheng|first63=Hancheng|last64=Li|first64=Yingrui|last65=Steiner|first65=Cynthia C.|last66=Lam|first66=Tommy Tsan-Yuk|last67=Lin|first67=Siyuan|last68=Zhang|first68=Qinghui|last69=Li|first69=Guoqing|last70=Tian|first70=Jing|last71=Gong|first71=Timing|last72=Liu|first72=Hongde|last73=Zhang|first73=Dejin|last74=Fang|first74=Lin|last75=Ye|first75=Chen|last76=Zhang|first76=Juanbin|last77=Hu|first77=Wenbo|last78=Xu|first78=Anlong|last79=Ren|first79=Yuanyuan|last80=Zhang|first80=Guojie|last81=Bruford|first81=Michael W.|last82=Li|first82=Qibin|last83=Ma|first83=Lijia|last84=Guo|first84=Yiran|last85=An|first85=Na|last86=Hu|first86=Yujie|last87=Zheng|first87=Yang|last88=Shi|first88=Yongyong|last89=Li|first89=Zhiqiang|last90=Liu|first90=Qing|last91=Chen|first91=Yanling|last92=Zhao|first92=Jing|last93=Qu|first93=Ning|last94=Zhao|first94=Shancen|last95=Tian|first95=Feng|last96=Wang|first96=Xiaoling|last97=Wang|first97=Haiyin|last98=Xu|first98=Lizhi|last99=Liu|first99=Xiao|last100=Vinar|first100=Tomas|last101=Wang|first101=Yajun|last102=Lam|first102=Tak-Wah|last103=Yiu|first103=Siu-Ming|last104=Liu|first104=Shiping|last105=Zhang|first105=Hemin|last106=Li|first106=Desheng|last107=Huang|first107=Yan|last108=Wang|first108=Xia|last109=Yang|first109=Guohua|last110=Jiang|first110=Zhi|last111=Wang|first111=Junyi|last112=Qin|first112=Nan|last113=Li|first113=Li|last114=Li|first114=Jingxiang|last115=Bolund|first115=Lars|last116=Kristiansen|first116=Karsten|last117=Wong|first117=Gane Ka-Shu|last118=Olson|first118=Maynard|last119=Zhang|first119=Xiuqing|last120=Li|first120=Songgang|last121=Yang|first121=Huanming|last122=Wang|first122=Jian|last123=Wang|first123=Jun|title=The sequence and de novo assembly of the giant panda genome|journal=Nature|volume=463|issue=7279|year=2009|pages=311–317|issn=0028-0836|doi=10.1038/nature08696}}</ref>. |
||
=== Indel Discovery === |
|||
⚫ | |||
=== Structural Variation Discovery === |
|||
⚫ | |||
=== Structural Variation Discovery === |
|||
⚫ | |||
== History == |
== History == |
Revision as of 07:44, 9 January 2015
![]() | The topic of this article may not meet Wikipedia's general notability guideline. (December 2009) |
SOAP (Short Oligonucleotide Analysis Package) is a suite of bioinformatics software tools enabling the assembly, alignment, and analysis of next generation DNA sequencing data. It is particularly suited to short read data.
All programs in the SOAP package may be used free of charge and are distributed under the GPL.
Functionality
The SOAP package can be used to perform the following tasks:
Sequence Alignment
SOAPaligner is specifically designed for fast alignment of short reads and performs favorably with respect to similar alignment tools such as Bowtie and MAQ [1].
Genome Assembly
SOAPdenovo is a short read denovo assembler. It is optimized for short reads such as that generated by Illumina and is capable of assembling large genomes such as the human genome [2]. SOAPdenovo was used to assemble the genome of the giant panda [3].
Indel Discovery
SOAPindel is a tool to find insertions and deletions from next-generation paired-end sequencing data.
Structural Variation Discovery
SOAPsnp is a consensus sequence builder. This tool uses the output from SOAPaligner to generate a consensus sequence which enables SNPs to be called on a newly sequenced individual.
Structural Variation Discovery
SOAPsv is a tool to find structural variations using whole genome assembly.
History
SOAP v1
The first release of SOAP consisted only of the sequence alignment tool SOAPaligner. [4]
SOAP v2
SOAP v2 extended and improved on SOAP v1 by significantly improving the performance of the SOAPaligner tool. Alignment time was reduced by a factor of 20-30, while memory usage was reduced by a factor of 3. Support was added for compressed file formats.
The SOAP suite was expanded to include the new tools: SOAPdenovo, SOAPindel, SOAPsnp, and SOAPsv.
SOAP v3
SOAP v3 extended the alignment tool by being the first short-read alignment tool to utilize GPU processors [5]. As a result of these improvements, SOAPalign significantly outperforms competing aligners Bowtie and BWA in terms of speed.
See also
External links
- http://soap.genomics.org.cn
- http://soap.genomics.org.cn/soap1
- http://bioinformatics.genomics.org.cn
- http://seqanswers.com/forums/showthread.php?t=43
References
- ^ Li, R.; Yu, C.; Li, Y.; Lam, T.-W.; Yiu, S.-M.; Kristiansen, K.; Wang, J. (2009). "SOAP2: an improved ultrafast tool for short read alignment". Bioinformatics. 25 (15): 1966–1967. doi:10.1093/bioinformatics/btp336. ISSN 1367-4803.
- ^ Li, R.; Zhu, H.; Ruan, J.; Qian, W.; Fang, X.; Shi, Z.; Li, Y.; Li, S.; Shan, G.; Kristiansen, K.; Li, S.; Yang, H.; Wang, J.; Wang, J. (2009). "De novo assembly of human genomes with massively parallel short read sequencing". Genome Research. 20 (2): 265–272. doi:10.1101/gr.097261.109. ISSN 1088-9051.
- ^ Li, Ruiqiang; Fan, Wei; Tian, Geng; Zhu, Hongmei; He, Lin; Cai, Jing; Huang, Quanfei; Cai, Qingle; Li, Bo; Bai, Yinqi; Zhang, Zhihe; Zhang, Yaping; Wang, Wen; Li, Jun; Wei, Fuwen; Li, Heng; Jian, Min; Li, Jianwen; Zhang, Zhaolei; Nielsen, Rasmus; Li, Dawei; Gu, Wanjun; Yang, Zhentao; Xuan, Zhaoling; Ryder, Oliver A.; Leung, Frederick Chi-Ching; Zhou, Yan; Cao, Jianjun; Sun, Xiao; Fu, Yonggui; Fang, Xiaodong; Guo, Xiaosen; Wang, Bo; Hou, Rong; Shen, Fujun; Mu, Bo; Ni, Peixiang; Lin, Runmao; Qian, Wubin; Wang, Guodong; Yu, Chang; Nie, Wenhui; Wang, Jinhuan; Wu, Zhigang; Liang, Huiqing; Min, Jiumeng; Wu, Qi; Cheng, Shifeng; Ruan, Jue; Wang, Mingwei; Shi, Zhongbin; Wen, Ming; Liu, Binghang; Ren, Xiaoli; Zheng, Huisong; Dong, Dong; Cook, Kathleen; Shan, Gao; Zhang, Hao; Kosiol, Carolin; Xie, Xueying; Lu, Zuhong; Zheng, Hancheng; Li, Yingrui; Steiner, Cynthia C.; Lam, Tommy Tsan-Yuk; Lin, Siyuan; Zhang, Qinghui; Li, Guoqing; Tian, Jing; Gong, Timing; Liu, Hongde; Zhang, Dejin; Fang, Lin; Ye, Chen; Zhang, Juanbin; Hu, Wenbo; Xu, Anlong; Ren, Yuanyuan; Zhang, Guojie; Bruford, Michael W.; Li, Qibin; Ma, Lijia; Guo, Yiran; An, Na; Hu, Yujie; Zheng, Yang; Shi, Yongyong; Li, Zhiqiang; Liu, Qing; Chen, Yanling; Zhao, Jing; Qu, Ning; Zhao, Shancen; Tian, Feng; Wang, Xiaoling; Wang, Haiyin; Xu, Lizhi; Liu, Xiao; Vinar, Tomas; Wang, Yajun; Lam, Tak-Wah; Yiu, Siu-Ming; Liu, Shiping; Zhang, Hemin; Li, Desheng; Huang, Yan; Wang, Xia; Yang, Guohua; Jiang, Zhi; Wang, Junyi; Qin, Nan; Li, Li; Li, Jingxiang; Bolund, Lars; Kristiansen, Karsten; Wong, Gane Ka-Shu; Olson, Maynard; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian; Wang, Jun (2009). "The sequence and de novo assembly of the giant panda genome". Nature. 463 (7279): 311–317. doi:10.1038/nature08696. ISSN 0028-0836.
- ^ Li, R.; Li, Y.; Kristiansen, K.; Wang, J. (2008). "SOAP: short oligonucleotide alignment program". Bioinformatics. 24 (5): 713–714. doi:10.1093/bioinformatics/btn025. ISSN 1367-4803.
- ^ Liu, C.-M.; Wong, T.; Wu, E.; Luo, R.; Yiu, S.-M.; Li, Y.; Wang, B.; Yu, C.; Chu, X.; Zhao, K.; Li, R.; Lam, T.-W. (2012). "SOAP3: ultra-fast GPU-based parallel alignment tool for short reads". Bioinformatics. 28 (6): 878–879. doi:10.1093/bioinformatics/bts061. ISSN 1367-4803.