Skip to main content

Hindi/Marathi - Devanagari characters and their uses frequencies

Character -Use Frequency- % Use frequency
256675 6.4440754424
200819 5.0417562531
196683 4.9379179517
 े 178697 4.4863619337
 ् 145168 3.6445837882
125064 3.1398533209
124261 3.1196932251
 ी 122363 3.0720420897
114875 2.884048569
112860 2.8334600348
 ि 111272 2.7935917508
 ं 109402 2.7466435826
101555 2.5496370179
 ो 87486 2.1964210934
82563 2.0728243918
76621 1.9236447043
69420 1.7428565977
58501 1.4687244861
 ै 57906 1.4537864325
57749 1.4498447949
51867 1.3021714658
46947 1.178650082
46739 1.1734280398
44371 1.1139770973
 ु 38982 0.9786810126
31487 0.7905117501  
29446 0.7392704606
28289 0.7102228507
 ू 23857 0.5989531814
23764 0.596618326
23033 0.5782658602
21582 0.5418370944
21063 0.5288070947
18110 0.4546691585
17347 0.4355133016
16911 0.4245670977
 ँ 16500 0.4142485431
15676 0.3935612219
14101 0.3540193155
12879 0.3233398174
11804 0.2963508971
11470 0.2879655024
11322 0.2842498185
10503 0.2636880272
8720 0.2189240785
7938 0.1992912082
6350 0.1594229242
 ौ 5632 0.141396836
4622 0.116039804
 ृ 4536 0.1138806904
4196 0.1053446598
3449 0.0865904985
 ़ 2983 0.0748911154
 ॉ 2898 0.0727571078
2374 0.0596015783
2177 0.0546557017
1901 0.0477264534
1709 0.0429061067
1347 0.0338177447
943 0.0236749319
584 0.0146618878
449 0.0112725816
271 0.0068037185
 ः 226 0.0056739497
134 0.0033642003
28 0.0007029672
28 0.0007029672
24 0.0006025433
22 0.0005523314
17 0.0004268015
 ॅ 12 0.0003012717
 ॄ 11 0.0002761657
8 0.0002008478
7 0.0001757418
7 0.0001757418
6 0.0001506358
4 0.0001004239
3 7.53179169273503E-005
2 5.02119446182336E-005
2 5.02119446182336E-005
2 5.02119446182336E-005
1 0.000025106
1 0.000025106
 ॊ 1 0.000025106
1 0.000025106
1 0.000025106
 ऀ 0 0
0 0
0 0
0 0
0 0
0 0
 ऻ 0 0
 ऺ 0 0
0 0
 ॆ 0 0
 ॎ 0 0
 ॏ 0 0
 ॗ 0 0
 ॖ 0 0
 ॕ 0 0
 ॔ 0 0
 ॓ 0 0
 ॒ 0 0
 ॑ 0 0
0 0
0 0
 ॢ 0 0
 ॣ 0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
0 0
ॿ 0 0
0 0
0 0
0 0
0 0

Total: 3041617 76.3627521769
Total Number of Characters: 3983116
Total Number SPACE characters: 941499 (23.637247823%)
Total Number Non Space Unicode characters: 3041617 (76.3627521769%)


Roman/English Character frequencies:
Char Uses freq %uses freq
e 1109679 9.4019327323
t 801072 6.7872105877
a 699276 5.9247277035
o 662934 5.6168142985
h 636070 5.3892047638
n 608327 5.154147761
s 540348 4.5781848157
i 535384 4.5361265321
r 491537 4.1646258147
d 387383 3.2821643986
l 360469 3.0541312308
u 250914 2.1259089787
m 214609 1.8183090621
f 201207 1.7047584745
w 188613 1.5980537961
c 176679 1.4969410732
y 172307 1.4598986043
g 162269 1.3748500445
p 129631 1.0983193717
b 126068 1.0681312845
v 82693 0.7006296627
k 64679 0.5480031678
x 9039 0.0765843726
j 7510 0.0636296756
q 6877 0.0582664819
z 4539 0.0384574032
8630113 73.1200120922

Totalchars:11802669
non alphabatical chars: 3172556 (26.879987908%)
Non space english chars: 8630113 (73.120012092)

Comments

Popular posts from this blog

Publishing business basics

Basic Steps:
1. Decide name for the company
2. Register the company with ministry - you will need an attorney (Lawyer for that)
3. Register with Registrar of News Papers in India if it's a magazine/News paper. 
4. Study the relevant acts in general or get them known from the lawyer
5. Start publishing

Following are details regarding the same (not that well written) :

-----
Some starts and books;
* Start Your Own Self-Publishing Business (Entrepreneur Magazine's Start Up) by Entrepreneur Press 
* How To Start And Run A Small Book Publishing Company: A Small Business Guide To Self-Publishing And Independent Publishing by Peter I. Hupalo * Art & Science Of Book Publishing by Herbert S., Jr. Bailey * This Business of Books: A Complete Overview of the Industry from Concept Through Sales by Claudia Suzanne
Raja Rammohun Roy National Agency for ISBN
West Block-I, Wing-6, 2nd Floor,
Sector -I, R.K. Puram,
New Delhi-110066


Some new things and the initiatives in the area : Pothi.com

Starting it is …

काही सुंदर अशी मराठी गाणी

>Suhasya tuze manasi mohi
http://www.esnips.com/displayimage.php?pid=22969903

>Jenvha Tuzya batanna udawi mujor wara
http://www.esnips.com/displayimage.php?pid=22969806

>Bhay ithale sampat nahi
http://www.esnips.com/displayimage.php?pid=22969806

>Pahile na mi tula
http://www.esnips.com/displayimage.php?pid=22969877

>Te sparsha chandanyanche
http://www.esnips.com/displayimage.php?pid=2086815

Installing SyntaxNet on ubuntu - Deep learning - tensorflow

1. Install Java8 (Java7(deprecated)
2. Install Brazel:
$ echo "deb [arch=amd64] http://storage.googleapis.com/bazel-apt stable jdk1.8" | sudo tee /etc/apt/sources.list.d/bazel.list $ curl https://storage.googleapis.com/bazel-apt/doc/apt-key.pub.gpg | sudo apt-key add - sudo apt-get update && sudo apt-get install bazel 3. sudo apt-get install swig
4. sudo pip install -U protobuf==3.0.0b2 5. sudo pip install asciitree 6. sudo pip install numpy Then you must have git installed : sudo apt-get install git and then built and test

git clone --recursive https://github.com/tensorflow/models.git cd models/syntaxnet/tensorflow ./configure cd .. bazel test syntaxnet/... util/utf8/... # On Mac, run the following: bazel test --linkopt=-headerpad_max_install_names \ syntaxnet/... util/utf8/...