Page 259 - Big Data Analytics for Intelligent Healthcare Management
P. 259
252 CHAPTER 10 COMPUTATIONAL BIOLOGY APPROACH ON GENETIC
DISORDER
Table 10.2 Statistic Results of Assembly of Xylella fastidiosa Bacteria Using SOAPdenovo2
Software [42]
Maximum
Hash Total Sequence
SL No. Length Sequence Total Base Length N50
1 31 43,137 6,405,006 21,741 100
2 33 42,071 6,348,017 21,741 100
3 35 4903 2,679,512 25,983 5145
4 37 4372 2,666,958 28,153 5685
5 39 3770 2,648,446 28,176 6819
6 41 3261 2,631,326 31,352 8185
7 43 2919 2,620,635 37,054 10,093
8 45 2517 2,605,134 42,933 12,236
9 47 2123 2,589,443 44,936 13,327
10 49 1779 2,574,451 72,921 15,169
11 51 1569 2,567,126 71,165 17,259
12 53 1469 2,565,632 71,167 18,351
13 55 1391 2,564,930 71,169 19,978
14 57 1304 2,563,311 71,171 22,817
15 59 1204 2,560,261 75,854 24,957
16 61 1137 2,558,702 78,458 28,870
17 63 1060 2,550,790 79,618 31,384
18 65 1009 2,549,665 80,014 32,528
19 67 971 2,549,749 80,194 35,625
20 69 933 2,549,926 87,107 39,461
21 71 890 2,549,199 87,111 39,465
22 73 869 2,550,037 112,854 40,295
23 75 837 25,49,531 129,552 41,360
24 77 829 2,551,292 129,556 41,364
25 79 821 2,552,692 129,560 44,880
26 81 804 2,553,587 129,564 45,080
27 83 806 2,556,210 129,568 47,198
28 85 799 2,557,905 129,572 50,416
29 87 780 2,558,243 124,761 50,420
30 89 782 2,560,199 124,765 50,424
different Kmer values respectively. The performance of each assembler was measured by a matrix
called N50 length. A higher value of N50 length indicates a better performance of the assembly
tools [30].
The hash length of 59, 75, and 89 showed better performance through the Velvet software whereas
the hash length of 85, 87, and 89 showed better performance through the SOAPdenovo2 software.
As can be seen by the above results, the present study suggests that the hash length of 89 provided