Page 259 - Big Data Analytics for Intelligent Healthcare Management
P. 259

252     CHAPTER 10 COMPUTATIONAL BIOLOGY APPROACH ON GENETIC
                     DISORDER




              Table 10.2 Statistic Results of Assembly of Xylella fastidiosa Bacteria Using SOAPdenovo2
              Software [42]
                                                                      Maximum
                           Hash         Total                         Sequence
              SL No.       Length       Sequence       Total Base     Length          N50
              1            31           43,137         6,405,006      21,741          100
              2            33           42,071         6,348,017      21,741          100
              3            35           4903           2,679,512      25,983          5145
              4            37           4372           2,666,958      28,153          5685
              5            39           3770           2,648,446      28,176          6819
              6            41           3261           2,631,326      31,352          8185
              7            43           2919           2,620,635      37,054          10,093
              8            45           2517           2,605,134      42,933          12,236
              9            47           2123           2,589,443      44,936          13,327
              10           49           1779           2,574,451      72,921          15,169
              11           51           1569           2,567,126      71,165          17,259
              12           53           1469           2,565,632      71,167          18,351
              13           55           1391           2,564,930      71,169          19,978
              14           57           1304           2,563,311      71,171          22,817
              15           59           1204           2,560,261      75,854          24,957
              16           61           1137           2,558,702      78,458          28,870
              17           63           1060           2,550,790      79,618          31,384
              18           65           1009           2,549,665      80,014          32,528
              19           67           971            2,549,749      80,194          35,625
              20           69           933            2,549,926      87,107          39,461
              21           71           890            2,549,199      87,111          39,465
              22           73           869            2,550,037      112,854         40,295
              23           75           837            25,49,531      129,552         41,360
              24           77           829            2,551,292      129,556         41,364
              25           79           821            2,552,692      129,560         44,880
              26           81           804            2,553,587      129,564         45,080
              27           83           806            2,556,210      129,568         47,198
              28           85           799            2,557,905      129,572         50,416
              29           87           780            2,558,243      124,761         50,420
              30           89           782            2,560,199      124,765         50,424



             different Kmer values respectively. The performance of each assembler was measured by a matrix
             called N50 length. A higher value of N50 length indicates a better performance of the assembly
             tools [30].
                The hash length of 59, 75, and 89 showed better performance through the Velvet software whereas
             the hash length of 85, 87, and 89 showed better performance through the SOAPdenovo2 software.
             As can be seen by the above results, the present study suggests that the hash length of 89 provided
   254   255   256   257   258   259   260   261   262   263   264