123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121 |
- = Logs of language model for Vietnamese (vi) =
- - Generated by BuildLangModel.py
- - Started: 2016-02-13 03:37:17.480303
- - Maximum depth: 3
- - Max number of pages: 40
- == Parsed pages ==
- Chữ_Quốc_ngữ (revision 22887853)
- 1651 (revision 21455247)
- 1773 (revision 21354755)
- 1815 (revision 21361292)
- 1838 (revision 21361314)
- 1865 (revision 21361338)
- 1869 (revision 21361342)
- 1888 (revision 21389506)
- 1902 (revision 21354811)
- 1918 (revision 21354828)
- 1919 (revision 21354829)
- 1938 (revision 21354849)
- 1945 (revision 21354857)
- 22 tháng 2 (revision 21376086)
- 26 tháng 11 (revision 22579845)
- 28 tháng 12 (revision 22475308)
- A (revision 22549334)
- ASCII (revision 22528409)
- Alexandre de Rhodes (revision 22859954)
- Antonio Barbosa (revision 22145269)
- B (revision 22836557)
- BBC (revision 22863903)
- Biên khảo (revision 22531516)
- Bán nguyên âm (revision 22655600)
- Bình luận (revision 22117664)
- Bảng chữ cái Bồ Đào Nha (revision 22887853)
- Bảng chữ cái Hy Lạp (revision 21362081)
- Bảng chữ cái Latinh (revision 22442448)
- Bắc Kỳ (revision 22393289)
- Bồ Đào Nha (revision 22620858)
- C (revision 21341881)
- Cao Xuân Dục (revision 22620201)
- Chính tả (revision 22187359)
- Chính tả tiếng Việt (revision 20897580)
- Chữ Hán (revision 22889609)
- Chữ Nôm (revision 22781506)
- Chữ cái (revision 22169220)
- Công giáo (revision 22173119)
- D (revision 21447691)
- == End of Parsed pages ==
- - Wikipedia parsing ended at: 2016-02-13 03:42:06.560479
- 101 characters appeared 222814 times.
- First 55 characters:
- [ 0] Char n: 11.262308472537633 %
- [ 1] Char h: 8.881398834902654 %
- [ 2] Char t: 7.022898022565907 %
- [ 3] Char c: 6.365398942615815 %
- [ 4] Char i: 6.198443544840091 %
- [ 5] Char g: 5.591210606155808 %
- [ 6] Char a: 3.5998635633308496 %
- [ 7] Char u: 2.8499106878382867 %
- [ 8] Char m: 2.615185760320267 %
- [ 9] Char o: 2.6012728105056238 %
- [10] Char đ: 2.222032726848403 %
- [11] Char r: 2.1102803234985235 %
- [12] Char à: 2.0447548179198796 %
- [13] Char v: 1.9437737305555307 %
- [14] Char l: 1.9119085874316697 %
- [15] Char á: 1.7539292863105551 %
- [16] Char p: 1.6453185167897888 %
- [17] Char b: 1.541195795596327 %
- [18] Char ư: 1.4397659033992478 %
- [19] Char s: 1.3760356171515256 %
- [20] Char y: 1.280440187779942 %
- [21] Char e: 1.2454334108269678 %
- [22] Char d: 1.1251537156552103 %
- [23] Char ế: 1.071745940560288 %
- [24] Char k: 1.0695019163966357 %
- [25] Char â: 0.9658280000359044 %
- [26] Char ữ: 0.9604423420431392 %
- [27] Char ê: 0.8374698178749989 %
- [28] Char ệ: 0.7459136319979893 %
- [29] Char ô: 0.7073164163831717 %
- [30] Char ạ: 0.6727584442629277 %
- [31] Char ộ: 0.6705144200992756 %
- [32] Char ố: 0.6476253736300233 %
- [33] Char ó: 0.6072329386842837 %
- [34] Char ả: 0.5484395055965963 %
- [35] Char ủ: 0.5475418959311353 %
- [36] Char q: 0.5138815334763525 %
- [37] Char ợ: 0.48560682901433483 %
- [38] Char ờ: 0.4851580241816044 %
- [39] Char ể: 0.4748355130288043 %
- [40] Char ớ: 0.4676546357051173 %
- [41] Char ấ: 0.418286104104769 %
- [42] Char ị: 0.40212913012647317 %
- [43] Char ầ: 0.3904602044754818 %
- [44] Char ọ: 0.3801376933226817 %
- [45] Char ề: 0.3787912788244904 %
- [46] Char ơ: 0.3590438661843511 %
- [47] Char í: 0.35679984202069887 %
- [48] Char ụ: 0.35276059852612496 %
- [49] Char ậ: 0.3469261357006292 %
- [50] Char ì: 0.32762752789322036 %
- [51] Char ă: 0.3253835037295682 %
- [52] Char ứ: 0.29665999443482005 %
- [53] Char ồ: 0.29665999443482005 %
- [54] Char x: 0.2939671654384374 %
- The first 55 characters have an accumulated ratio of 0.9603301408349568.
- 1494 sequences found.
- First 512 (typical positive ratio): 0.9321889118082535
- Next 512 (512-1024): 0.009604423420431392
- Rest: 0.0068905733918831966
- - Processing end: 2016-02-13 03:42:07.174723
|