compact-dictionaries
📚 Compact dictionaries in English that automatically update weekly
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.9%) to scientific vocabulary
Repository
📚 Compact dictionaries in English that automatically update weekly
Statistics
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Compact Dictionaries
Compact dictionaries in English that automatically update weekly.
Copyright © 2023 Teal Dulcet
Preprocessed free dictionaries/thesauruses in JSON Lines format that are automatically updated weekly. The dictionaries include the part of speech, definitions (senses), forms of the word, synonyms, antonyms, pronunciation and more information.
All dictionaries are provided uncompressed and in a compact JSON format with minimal single character keys and no whitespace. They include much more information then would be found in a traditional paper dictionary or thesaurus, including the full list of meanings for each word. While the definitions are currently in English, they are available with words in over 100 languages. The dictionaries are designed so that applications can directly download them, without developers needing to release an entire software update. This allows users to enjoy much more frequent updates and thus more accurate information.
❤️ Please visit tealdulcet.com to support this project and my other software development.
The dictionaries are hosted on GitLab because while it now has a 100 MiB file size limit for regular files, it has no maximum file size for Git Large File Storage (LFS) files, just a 10 GiB repository size limit. In contrast, GitHub has a 100 MiB file size limit and strict bandwidth limits on Git LFS files. Commits older than one month (previously one year) are automatically squashed to keep the repository size under that limit. Please see the CHANGELOG for the full history. The dictionaries are now updated on GitHub as it has no limit for CI minutes for public repositories. In contrast, GitLab has a 400 CI minutes/month limit.
Dictionary comparison
| Dictionary | License | Updated | Download |
| --- | --- | --- | --- |
| Wiktionary | 🅭🅯🄎
CC BY-SA 3.0
GFDL | Weekly
| Acholi (ach)
⬇️ dictionary-ach.json
1.672KiB (1.712KB) – 20 wordsChecksums (click to show)
MD5: 1dcf62a00b6fa714ae1903bae8a42a3a
SHA1: adacb763ea2a478bad9ad695dc77e0a9a832ff0e
SHA256: 8c834bb585696b60d56be982d4a01a1f539176266bc04c4526ddc558db316389
⬇️ dictionary-af.json
986.7KiB (1.010MB) – 8,518 wordsChecksums (click to show)
MD5: 849ad46081781370ce78626d39b130ee
SHA1: 90456cbe5a91838aaf75fbd0055070327f51c57b
SHA256: 9dcd7dbfb30a6e681b39f8ac7458e8d534e390d4479c7839c702016aa9dad8d1
⬇️ dictionary-ang.json
6.839MiB (7.171MB) – 55,978 wordsChecksums (click to show)
MD5: 7cc4fc459d39e72d8c52e27412ab0445
SHA1: ac5e6d6d9e839de2c3e8d106792bab4d47080b73
SHA256: d77e4727c90e374e6848f39b5cf98d8f3ac57caf58c425867df98ea3a13851a3
⬇️ dictionary-an.json
165.1KiB (169.0KB) – 1,383 wordsChecksums (click to show)
MD5: e802170c185700493897249cd45a7009
SHA1: eb6812111efe2292ca4fb50951a1329e22b6d7cb
SHA256: 779c4c34d483504d662665d5994e6af406a394f1fb684f678e5be261e9ab7050
⬇️ dictionary-ar.json
28.61MiB (30.00MB) – 55,472 wordsChecksums (click to show)
MD5: 3dc3faf4022ac9fd204fc4ed1dfe4d62
SHA1: ff120e40b86815fe607ba0dfb4434b1477c3e66f
SHA256: 9a00ae04c78d9b7a7bd9d141c012f8910e5750e0522ebfdeb642fc96dacfbe6a
⬇️ dictionary-ast.json
3.141MiB (3.293MB) – 31,436 wordsChecksums (click to show)
MD5: 04cc1e494ce4b499001cd7e652cdb233
SHA1: a2a0d1a752737142e06d4b075c167ddcc407677a
SHA256: e66a76fa39b2621644d9e7cf56e49c6534fd1e540d1c177393fd5680ede991f8
⬇️ dictionary-az.json
14.38MiB (15.08MB) – 12,662 wordsChecksums (click to show)
MD5: 2f52412be215c14dfb19b84eb31ef514
SHA1: 3818f89aa7081d32d032c6888c5d4bf9dd5705d6
SHA256: 6f4efb9f5acbabab9e7452f8a7cf267f4ad5df9506b674bc4d991a531815e94a
⬇️ dictionary-be.json
2.004MiB (2.101MB) – 6,820 wordsChecksums (click to show)
MD5: 792fd6b2ee5cd6aba7a45b50891e1e64
SHA1: 6ea0dcd12ab37e6cd18c9d564c80ffe88d80e28f
SHA256: 379fe41a926ff78285f2c71852ae9521154f3aae86b1495ad7cde119bdc2884a
⬇️ dictionary-bg.json
13.24MiB (13.88MB) – 48,914 wordsChecksums (click to show)
MD5: fa55d7b5146dc386a2c5414d9fda653a
SHA1: 42a3ec71fecb373dfd6ed3f63773d69e2f1fbbd4
SHA256: ec7b79014d482045817bdd38d0a1618eee57c6a774e88bdf48a834d098c35d93
⬇️ dictionary-bn.json
2.041MiB (2.140MB) – 8,432 wordsChecksums (click to show)
MD5: ea6faef81f60abe3f31c06fb328353d2
SHA1: f6b07e9a18bfd2021e52559604a62a2eb5648f76
SHA256: 55994f4c44936136c3cc5792c63ea041e58254589a2ebc1a58047557ac4aed16
⬇️ dictionary-bo.json
326.6KiB (334.4KB) – 3,285 wordsChecksums (click to show)
MD5: 618563db0dac3890595f748c98f2a840
SHA1: 2695075f223e6826e6e09baf68bc3174fa75aba5
SHA256: 5117457a87ccbabcfbd61a57026355707d9500a7d63295ace1723cae465e3cf7
⬇️ dictionary-br.json
186.7KiB (191.2KB) – 2,038 wordsChecksums (click to show)
MD5: 8abcc35f2adf7ee5a8645bde3a54e36e
SHA1: 60f8c47c8d4e259bf56da4a203854675e9fba9d3
SHA256: 977ea339d969c12b292e2dc148e36803203bdd78a340e5100975a39006e498d4
⬇️ dictionary-brx.json
5.063KiB (5.185KB) – 69 wordsChecksums (click to show)
MD5: 18ba6b9763ba8cbb4064900412e6865a
SHA1: 3f3cb7c1e37b8736815a1da41f41b5288c76b8c5
SHA256: 1ffa06511cfd2913a9c7a1e90e39787506175674897d8eb624c89259e25a29a6
⬇️ dictionary-ca.json
19.63MiB (20.58MB) – 184,123 wordsChecksums (click to show)
MD5: deab6412e8c24ae967127005d3d7f899
SHA1: 6f460e628cc563207838943864d0a19944e37611
SHA256: f7679c9ce382e7f56a5c2cd6099f544559d3dff0707bfa974c54660304619cea
⬇️ dictionary-cak.json
6.407KiB (6.561KB) – 111 wordsChecksums (click to show)
MD5: 89da1c82961f0ada9c866c4aee197fe5
SHA1: 38aaff5bad8b8153728ea8fa29c5dbbaa1b9d16a
SHA256: 8c3ba643b4b7cdbf08bd2b1f93d4ce78123ab30c09a2fd3680d1ca49c87f9322
⬇️ dictionary-ckb.json
115.9KiB (118.7KB) – 953 wordsChecksums (click to show)
MD5: 9a37508dfab8d9cdbc99e58f31eda71a
SHA1: 8dacf3bb860812e34c8cfb3ce225dc969964e1df
SHA256: 01f0acbf8cb41a94e7be875679c118f509e814d76824f0caef97ccc09f015e13
⬇️ dictionary-cmn.json
6.486MiB (6.802MB) – 65,507 wordsChecksums (click to show)
MD5: fc6ab7c1080a402fce0873a0d5099518
SHA1: a2921f1366db8642aff179c90457f801113ea684
SHA256: be38ef92132d1b6f0128db4136b64f5b0ca6a78b15ed18b60e3b52d734f14be6
⬇️ dictionary-cs.json
11.56MiB (12.13MB) – 61,916 wordsChecksums (click to show)
MD5: 17acc5c7171b1f2ae9be17a2a57e7f68
SHA1: 8714690f61938bc077d7d1bb24d7727560f025eb
SHA256: 3e766a42bdb66bdb24fc9549341f3c064b77b7af5439d9f16960eadaefaa1c2c
⬇️ dictionary-cy.json
2.997MiB (3.142MB) – 19,496 wordsChecksums (click to show)
MD5: ee769a251bd57d61bbf4d942d533419d
SHA1: 178bb79c80112204fa73114767090601990d047a
SHA256: 2bf0495c0dcf2ef1f858a16c4cd121883ba7b52a79c3ce7603627db98196eb5c
⬇️ dictionary-da.json
5.098MiB (5.346MB) – 48,756 wordsChecksums (click to show)
MD5: 2aa47ad43217b2e60f7a57ca6d5c4f4b
SHA1: 8dc32aae9179f19660789c337c8a26211040bfc2
SHA256: 580dfe345255c894b9f3cd19aaf897d7280c00a1e769ca37b758e15c38cb159e
⬇️ dictionary-de.json
41.93MiB (43.97MB) – 319,626 wordsChecksums (click to show)
MD5: 473bd11687ea00a63ccfb44f950a933b
SHA1: e561c1f83dc970c442712fbfddca0b28971b724b
SHA256: afe90f412d2a14804eb23dcd13c1b76adfa25de27afd1b7d783f280c16a410d0
⬇️ dictionary-dsb.json
477.3KiB (488.7KB) – 3,371 wordsChecksums (click to show)
MD5: b78041812e2f7482c334069f71466480
SHA1: 5fe12637d855d2eb88ca5a341430f844946a1382
SHA256: 5a409c04035d838465b8919c1bd220b710671958f5422b5b6f875870c9ecebb4
⬇️ dictionary-el.json
16.55MiB (17.35MB) – 77,280 wordsChecksums (click to show)
MD5: e74469e0e501c05928f851794f087ca2
SHA1: cdd3e1827819a86b27ceff2232d70116f1884533
SHA256: 843c8ba250c878b744a09567c972fd65b4c15093b6699e0d88a55393d6975b7e
⬇️ dictionary-en.json
113.2MiB (118.7MB) – 1,017,248 wordsChecksums (click to show)
MD5: 9c89adab3377883063cc25cdebf5990e
SHA1: e604d379a32014a3d7de757e5b88649630498a6b
SHA256: 6d5a1fd39014be154ba577380d92bffd11a45af0011c3860f57cce7a94f929f2
⬇️ dictionary-eo.json
13.82MiB (14.49MB) – 129,750 wordsChecksums (click to show)
MD5: b062aaa0d03b5552cec60ef48fbc993e
SHA1: 0f8ca85adfa971bb4e19526811483a7548e63c18
SHA256: 1a65ab988772d7225203c5b8d50b67ddaaaf2491f89e679262f639b1ac7e7988
⬇️ dictionary-es.json
77.04MiB (80.78MB) – 734,834 wordsChecksums (click to show)
MD5: 79697f26541046166229f4cba4acb22d
SHA1: fd9679058726beed1f5c233d50b39569d661cb1e
SHA256: cd51b0fa2772a2c051738f3ec008101770aa1a13dfd122ff8ff4ac7ee2ece058
⬇️ dictionary-et.json
3.067MiB (3.216MB) – 11,428 wordsChecksums (click to show)
MD5: 38ef2084034adee0089de301d2e8bc86
SHA1: c14b2ea273655b23faa9b0330064e3f17bc5995e
SHA256: 5aec22406ecb531d7bd8080db9650a25335b3647188bc483ab2d130e1038588b
⬇️ dictionary-eu.json
3.141MiB (3.293MB) – 9,450 wordsChecksums (click to show)
MD5: 04fe74d7532cae37e21611afd9c1189f
SHA1: 527057c652b5e391d606889a890f6dfaea9c1ad4
SHA256: 6b6bc8fea7798dda3a2406c60479ce8193dd1d46399e178efe61737db2a5692d
⬇️ dictionary-fa.json
2.087MiB (2.188MB) – 14,027 wordsChecksums (click to show)
MD5: 4b25833a6d5567a9f550c2a3c70b8109
SHA1: 9b4c2e39d78d9a814cf30523a6f91b6e29fae335
SHA256: c03937d0951555cc128d572a0933674aa84a540de8f3fc075381c312687ecdfd
⬇️ dictionary-ff.json
146.0KiB (149.5KB) – 1,830 wordsChecksums (click to show)
MD5: 043882facf9664ea709257c96cea4f4b
SHA1: 707462e3b18a2f7b190a8bce5acebcdd6fc3cc59
SHA256: b62eaa1e71ebf619f13690bb27233f9528ea742a3c84346ee1d61dd9166a7050
⬇️ dictionary-fi.json
501.9MiB (526.3MB) – 242,170 wordsChecksums (click to show)
MD5: b108a33d926381332ebbf8dd59ac1e4c
SHA1: 82c13d5c87125ae1e1a4de81e717bf48680b6831
SHA256: 2ea240c14c5c94a14f5690cbcb5e685451ae0ddd926252def3e614a31c0ff158
⬇️ dictionary-fr.json
41.16MiB (43.16MB) – 364,482 wordsChecksums (click to show)
MD5: d274f5f4457f31f8c7a42533e4770401
SHA1: d9f886e55022a64f12749602e4c4c5e5bf7873f4
SHA256: c4878b4f7b52a1ec2e1f1ff7b23aec5704de7344bcfc11c0802214707073408a
⬇️ dictionary-fur.json
154.5KiB (158.2KB) – 1,976 wordsChecksums (click to show)
MD5: 8abab374f875f03003bffea2dac473b1
SHA1: 6cbf4447bd8fcd9c8e3c68c8f4549ba8a48163bd
SHA256: 4f6e990af162e7b4e8a5d05b8eba8844906fd30f96b6fd6712f1e3c9ad53f27d
⬇️ dictionary-fy.json
281.3KiB (288.0KB) – 3,194 wordsChecksums (click to show)
MD5: 5002017e890bcbb609603c461b3672d9
SHA1: 21a2ccf1dcebf26e3c96fbe5fac3eed51d9711e5
SHA256: 16df72569e9fd4d9cc98b2bb54b73f2fafdad6927804d51abb226d0c0a9ceca0
⬇️ dictionary-ga.json
4.829MiB (5.064MB) – 28,111 wordsChecksums (click to show)
MD5: e08317f5c7befe3ec50e63140545aa1a
SHA1: b6b224eef48c25133c63125cdcb1d90011210b1f
SHA256: 49489d3dae01500a06786ebf16361316c8c201e1b7ffd995e0b1a50432911f72
⬇️ dictionary-gd.json
1.425MiB (1.495MB) – 14,639 wordsChecksums (click to show)
MD5: f8bd5ea7873b0a1bd672e682b5584018
SHA1: 91f39036c85631da15c33d5717b397bd61b92dc4
SHA256: bdea07db9371ad1a6a89f33baf6909365999b75e7ddc014c264534bd1fdc2c2d
⬇️ dictionary-gl.json
21.25MiB (22.28MB) – 198,497 wordsChecksums (click to show)
MD5: a081004badc6e4b2a9095461c0ce974a
SHA1: f253804a9fee66b4093afeac014b0afacae09cd5
SHA256: 1cdf0c9ec7b3b3f90469a5b9a86b42cf7cafd96931024ee0eca805ec09e32568
⬇️ dictionary-gn.json
88.78KiB (90.91KB) – 791 wordsChecksums (click to show)
MD5: 45db420ad1d52237c5e7aa284b9b645a
SHA1: 062567ae62c004e53841b9165cf9710228404496
SHA256: 407f069bafd6c84142b034b7cce326ad453699d186a96ece9afcfdbd1a1c316c
⬇️ dictionary-grc.json
25.95MiB (27.22MB) – 48,261 wordsChecksums (click to show)
MD5: 0106d97f1182285671fefb229815cd62
SHA1: ebc3cc6722a2064681c5acf3360e872f921c7118
SHA256: 12130d2ad101697dd4f232d44de098118d75ffefadf1abb18b6f622868f67cdc
⬇️ dictionary-gu.json
829.8KiB (849.7KB) – 5,768 wordsChecksums (click to show)
MD5: cb70e94d8c8fb6374d39392cfcad0a28
SHA1: f47c2af3cfedda576507d211b0d0d399deab1904
SHA256: 2833c7824c1e7ef50082aecdc543831833b1ccd6580966bcca6f70111c0205a3
⬇️ dictionary-he.json
3.610MiB (3.785MB) – 11,192 wordsChecksums (click to show)
MD5: 4d432349500c1a8cacd543570ba1e1b7
SHA1: 0b50a0083bb74052e12525b59d3385c98353156e
SHA256: 1e80683e23efd37f81a8a3f625db19dee756f2ab84356e166fcf2b82a88d9414
⬇️ dictionary-hi.json
5.606MiB (5.878MB) – 28,522 wordsChecksums (click to show)
MD5: e7f69e382ac4480164e3509834e8ccbe
SHA1: 356d73eb987494e94b8084ded6cdc3cc76ce7c77
SHA256: 1b59afdc533d79144dc3c2f8623e0a4593a171b4d7c62499ab1420562c7114a0
⬇️ dictionary-hsb.json
258.2KiB (264.4KB) – 1,348 wordsChecksums (click to show)
MD5: d21ee02da879268ec3402ad0cbfc2caa
SHA1: 5cc921b036e85fc9132d16dc7e1ac7dea8b52ecb
SHA256: 1535746629adc95c3dbf5958aefa6fe7bf378562951ac4bd1e4f76a22bbff4f0
⬇️ dictionary-hu.json
38.38MiB (40.24MB) – 69,330 wordsChecksums (click to show)
MD5: 933f76931e7ea4b7b218d1a94658a286
SHA1: aa3354f808d74c50aec4db83fd525e27b688d7c1
SHA256: 804b23a527844f10060001103c99f27215d9b9607c2c88e26814dcdf253c79cd
⬇️ dictionary-hy.json
18.03MiB (18.90MB) – 18,806 wordsChecksums (click to show)
MD5: 25dfc9bb49649c890dab56aac167bbc9
SHA1: 60c45ae242fcbd024b9b733b71d5aff4977c3029
SHA256: c6f77959d952e587babf66b388597baf32c44598fecf600a64dc853e5311966c
⬇️ dictionary-ia.json
238.2KiB (243.9KB) – 3,461 wordsChecksums (click to show)
MD5: 08d82dc2d102e279c2b4bf5edb15dee1
SHA1: f2aa5c120d3d86aed21d8baf49f65b009daf1639
SHA256: 64c38c699e93e7729cd9778c05d114d72efe1b7e07d8d2b4871f02b5720528e5
⬇️ dictionary-id.json
2.933MiB (3.076MB) – 22,997 wordsChecksums (click to show)
MD5: ff48c5ad3f5d09a238293aa78aa511e2
SHA1: 06e447e561eb84295356197b0c0f1562b7fa82d7
SHA256: 363242ab66063cd5347207fbe70b61a0dbbde9730d8efcf93cf8592861ba59fd
⬇️ dictionary-is.json
4.457MiB (4.674MB) – 21,616 wordsChecksums (click to show)
MD5: cfdf58048530708bfc859d937959a56c
SHA1: 60439445ca5edf40bff3a92f70748ab0cfe8cca2
SHA256: 67fa81837989b9ce50e1bcd36a598fc78fbbaa56139b47c9a4cf3bb0335e7f76
⬇️ dictionary-it.json
55.86MiB (58.57MB) – 570,840 wordsChecksums (click to show)
MD5: 73de260b0a4582d2a99b585665ad7c73
SHA1: 8ce02edd8a8c055e10bb50b6aaee8024fbde8460
SHA256: 8ff3631fea356427d863f1d46139d54fb37f9d84fc7f3cc0063f8a51fcf01d44
⬇️ dictionary-ja.json
22.29MiB (23.37MB) – 108,767 wordsChecksums (click to show)
MD5: dbd6da2508d82b77417c3f60e1475632
SHA1: 141d4a8264da1b43b1850803e4e1bc4023e52f31
SHA256: 9ef3efb5ed968179603815d822a1be7b1d3a4aac92be83684aa72e31b1468f8b
⬇️ dictionary-kab.json
19.59KiB (20.06KB) – 272 wordsChecksums (click to show)
MD5: d392e04eb85d0a542b14b0144b7e9008
SHA1: 153ac169d1aa85273b087f580d5f96c1c0cc876a
SHA256: d4f388eb61becba3dc3bbb58834e9878c6ed55f01d097098fa0322137ac45592
⬇️ dictionary-ka.json
16.43MiB (17.22MB) – 20,484 wordsChecksums (click to show)
MD5: 3c6d1b56936a32c83a2bd1728189fd4f
SHA1: 4209a80b70c5b10c8f33d6a4451bf38160bd5705
SHA256: 23ff2272dc113c1aeb7d9547f9d34483ebe03be06771dae32dc84908dd5150ea
⬇️ dictionary-kk.json
2.624MiB (2.752MB) – 10,090 wordsChecksums (click to show)
MD5: 8b5945c639684c370b5d70512a3147e6
SHA1: 02f15156886e573d45067cf59bdee1f9f7a2b7ce
SHA256: 55e0e26c5b025835b1acf08b67d43118dcf5074997033613f40a3ad6849cf40c
⬇️ dictionary-km.json
1.063MiB (1.115MB) – 9,100 wordsChecksums (click to show)
MD5: e1989fc821136cacec5761ab84eb41c6
SHA1: b388a44024ef364adb111ef7db7ec3cd10ae1eba
SHA256: f937359986f5b84a1704e41899965cf72d0b2c6ee3f31e5114121118f35070e0
⬇️ dictionary-kn.json
639.6KiB (654.9KB) – 2,067 wordsChecksums (click to show)
MD5: 92294433db0a933a9a296fbf5e0c7113
SHA1: c6014f6dfc3d965510acfa2c76ec10785013f799
SHA256: 60b6bed78403888fca6289d1c946df65957c00817e56d56cae0bb8f485db5048
⬇️ dictionary-ko.json
9.617MiB (10.08MB) – 40,192 wordsChecksums (click to show)
MD5: 69f1e400c7c40dd98da0276794e55ae0
SHA1: 30c14120e67e25c7750bdaddf5c8e8f953ba0697
SHA256: d1e80650e908adda0c4c43654ede5850fbedeb8c831d8c96060e8c1ba1128ec1
⬇️ dictionary-la.json
105.7MiB (110.9MB) – 826,939 wordsChecksums (click to show)
MD5: 50e6532b712daded2c752961a370b993
SHA1: 7f36b019b8b5e82cd6fea3d9e8305d10a2140574
SHA256: 325d06a9c82c579ddd70944c87dc42d08133507597cbeaa3f141eec611868a53
⬇️ dictionary-lij.json
166.2KiB (170.2KB) – 1,647 wordsChecksums (click to show)
MD5: 058cc325ba935aa0a829d008135c0628
SHA1: 76b8e8fe4ed943b81098d25f070186a0b8d6353d
SHA256: 33ff20a283454caf0020022b0f99cc99a17b5043074c46e88f2ec4cdee308b2a
⬇️ dictionary-lo.json
266.8KiB (273.2KB) – 2,245 wordsChecksums (click to show)
MD5: 5985a8204c765f0ead044f4f400eef45
SHA1: e186de9eedcfff747a6957634176124f9d69320e
SHA256: 5c8acd5bfd88e168b271d60798762b0485e415f004aa2858fedeac77bf4a7586
⬇️ dictionary-ltg.json
102.1KiB (104.5KB) – 557 wordsChecksums (click to show)
MD5: 7b095fc60c454b98747f1291049346cd
SHA1: 2164c59ac51a283949e167d302aaa1b0d756c4f8
SHA256: 80ab6a331804a0a3c48d80fd2517d00e2cbb3576ed2e70c6ff2dd956425c717c
⬇️ dictionary-lt.json
5.704MiB (5.981MB) – 26,377 wordsChecksums (click to show)
MD5: 83b567063b895d3d0887dca8637acde3
SHA1: e0be8d826bee7dd760146e12288a002583ddf980
SHA256: 2d34b516accf59fefab2f9885b13358f7804637479031209c5fae636f5927caf
⬇️ dictionary-lv.json
13.24MiB (13.88MB) – 121,503 wordsChecksums (click to show)
MD5: 6009b75c9f4d88b7a0ca8127be738e1c
SHA1: 918588b6e1d408c7ac60c7499fab8d4f6633a800
SHA256: 0ea743463e2497598bc7bbbc7f0f15fca1c0ee6085cf2f83b246f73782a73c4e
⬇️ dictionary-mk.json
24.56MiB (25.75MB) – 63,240 wordsChecksums (click to show)
MD5: 299570a0ac7aea444a198e7651c83353
SHA1: d704635e1f33d3e128c9f5c9d3a3d1f7e908e484
SHA256: 4e1bfb38b085d0e78b18445938ad1e63465bf36077f875f8ac18289d3cd4ae2e
⬇️ dictionary-ml.json
1.626MiB (1.705MB) – 9,760 wordsChecksums (click to show)
MD5: 25fb08b619c844091b99d4ee5f96d4e0
SHA1: d45f11712bd07d9e586809c80edf1af54d97243e
SHA256: c989afc5f91066ba6c2c51c5229ea36f075c5da25c3131b9ce14e8e34bc4823c
⬇️ dictionary-mr.json
1.410MiB (1.479MB) – 4,106 wordsChecksums (click to show)
MD5: b023eea9db53308ee89cde5feb9d2653
SHA1: 785c698982d37d57b2f51a7179dc1a50e505164a
SHA256: 7002aea2d82d2fd4a83e79764d643ca84345e0a101dd6f722350695841a2b42a
⬇️ dictionary-ms.json
1.277MiB (1.339MB) – 10,982 wordsChecksums (click to show)
MD5: 5359e060d232d0cdb282a99b8da195e2
SHA1: f58be737041cdeab5841a7c8bd270f599eea6381
SHA256: d82cfec420bd2171280ae6d0a0da94543c99d881d9cc135314afc19b7ce578d9
⬇️ dictionary-mul.json
2.607MiB (2.734MB) – 21,980 wordsChecksums (click to show)
MD5: 0f54d2f821283a898d0f78b4ece05077
SHA1: 4a6430ddaee149a7727ec89d95a046847fa55175
SHA256: fa74d31c312ac4d563169c77edf9e98929ecce0c700be97e6fd5b8dd51d157a5
⬇️ dictionary-my.json
1.119MiB (1.173MB) – 7,896 wordsChecksums (click to show)
MD5: bbfa37ed7f2f49a6a607677191a73e29
SHA1: e1f3e92e69cd8bb99e53b8a94cdd85d3f4ce5a13
SHA256: 5d9caee68e2393b575a67eb0970146fbc3d7945d6212976e70e1beb65bba2079
⬇️ dictionary-nb.json
6.173MiB (6.473MB) – 69,106 wordsChecksums (click to show)
MD5: ffef6cd62728600f0b2ef1b330ea8e8b
SHA1: 9ff8ff3a89cadd4199de57885808341f98d2002a
SHA256: a4ec2cfb03645656fea70b77e707950cef210986b5177f6dab963a807b89c904
⬇️ dictionary-ne.json
2.037MiB (2.136MB) – 1,967 wordsChecksums (click to show)
MD5: 75eda6ec30aa30f6a7abbf3171fa22f0
SHA1: 6f3f720bb24bf6bdf94b1a529c13f5e70eff7380
SHA256: 33161664335ce4673ed920ab20c302959ec32f0b3828ab545ac1bc75732029c8
⬇️ dictionary-nl.json
15.42MiB (16.16MB) – 121,189 wordsChecksums (click to show)
MD5: 1d7b8d46c33dc9f5f7b4f0bddbc52bcf
SHA1: 88ad13504191b763dce66eb9451e191440dec8c2
SHA256: 46c22742fb321293447b3fce9dd6e8177c3311d334195c189342e61a851bdbe0
⬇️ dictionary-nn.json
5.210MiB (5.463MB) – 56,195 wordsChecksums (click to show)
MD5: ff8c40b5eb0832aee749c2de627b90cc
SHA1: 95fece11043fc6846679b429975bddbd960e7671
SHA256: 26df50d6f45f1100a7d6fc1b33f83d4b1a6a403c330ad240f8bedc0cbf83b574
⬇️ dictionary-oc.json
1.440MiB (1.510MB) – 6,516 wordsChecksums (click to show)
MD5: 3ca751991a7f8e0eeada57b6cf194fb7
SHA1: d5d2bb7e69e903949baef38cd76f57e3fb4c0278
SHA256: 25061ef67947fd0702845ac7adf3e5be16e1b086de3eb0588b72d8f614d56b27
⬇️ dictionary-pa.json
1.381MiB (1.448MB) – 7,527 wordsChecksums (click to show)
MD5: 6eff6934115cdb20e5c8c47683f09e7e
SHA1: 36b65e62a0c28b0ccd34af5ca29f6135c0bedfb4
SHA256: 6ec1f41d09aaca4275c8a887d7a757ab2ab62c1321cefd6ab2dd4cc57d7248a3
⬇️ dictionary-pl.json
34.95MiB (36.64MB) – 150,556 wordsChecksums (click to show)
MD5: a6cd4c2c553a2dda080a81fa64daaf08
SHA1: a483a1e4885f26a44dac83fa85466c34331b4ca8
SHA256: ad3e5f2ab1a9e4ce026fe2e8d6195733e1e2091a95f467cdc9777143c219bf3b
⬇️ dictionary-pt.json
38.13MiB (39.99MB) – 372,534 wordsChecksums (click to show)
MD5: 223d7fd04c03b3c176b99592e42f3b39
SHA1: e41878ef9141888d9a1e76c470fd1c445ad8f86a
SHA256: 4bfb8e0316b2bd942d3ed589a6a852a48fd2c41b9876121a710e7498bb51bcbb
⬇️ dictionary-rm.json
231.3KiB (236.8KB) – 2,148 wordsChecksums (click to show)
MD5: 9e014c744e0f5673809e60d2aa3b6436
SHA1: 51e8002b95bbb24541da0fd52e5a0fa90bc0fcf0
SHA256: 00f9e616c6c4598383114ee5804a23ded2e06890cf2b51e7bb7291573750645c
⬇️ dictionary-ro.json
19.34MiB (20.28MB) – 114,167 wordsChecksums (click to show)
MD5: 7e5d4ce8db9e46471e22c3abe1ebd455
SHA1: 652681126dd57e35c17123fd07c966b7e60d443f
SHA256: aad6193247c270e0d2a3530c13e5732b3a90d10aaa02e3850f43acf1e0666a6a
⬇️ dictionary-ru.json
96.67MiB (101.4MB) – 407,438 wordsChecksums (click to show)
MD5: a71ac4bbb5a267fefc6aae885b2e98a6
SHA1: 51fd053f050ea8c812515ca1b94cecaa8d6e0553
SHA256: 0cb3f94f626db32fa07ab36003d09636c2af213907d4dbf8c0c0128dbf8a18e8
⬇️ dictionary-sat.json
157.6KiB (161.4KB) – 719 wordsChecksums (click to show)
MD5: 4efea4d43ed9c0e7777bfc5c3e949dae
SHA1: 291077ccf6c0a5f1ac7d7c7d71925109c96555c5
SHA256: d3bab4a8245f8bee57708cb4371b865a43edb391c8089d942964ca055665192b
⬇️ dictionary-sc.json
134.4KiB (137.6KB) – 1,259 wordsChecksums (click to show)
MD5: f58c709d9d15eb20d9fcc5102f91d038
SHA1: 156efa1f6a6b69a4f4cd8f6f09493b6a58190146
SHA256: bcab588061f3125c6f91bf61b2bbc2b395a3ba00f4d74e13a2f7c4b5a1a51f96
⬇️ dictionary-scn.json
339.9KiB (348.1KB) – 2,865 wordsChecksums (click to show)
MD5: a0d0d4b454c57ab5001b81885b6ae1ac
SHA1: c2711194cfe0b80e66c632264e098c8d5f3d0fb9
SHA256: b8ea8baf9a3e0bf63d245efcb6cf8fea614d03d5e6c7f4ef4292cefddae836c2
⬇️ dictionary-sco.json
422.8KiB (432.9KB) – 4,703 wordsChecksums (click to show)
MD5: 219d7c2f64d7e0fdb5ba6917089366ad
SHA1: 6ecc1f674d80645c3cd9ae1fdfc10c1b7297eefa
SHA256: 554f9071bd0578a979b630451ebe61cc62e14a02c7a1a3a46e1640bceec944df
⬇️ dictionary-sh.json
17.42MiB (18.26MB) – 62,447 wordsChecksums (click to show)
MD5: 1ef5bc2e62cbc3026e0042ce4c2bd21c
SHA1: d102de23168be7c1e2ea74c1ce147a67f58b9fe9
SHA256: a3c71dc8cea318df1515e4371b9e9a13c87e80d7c86ed7a8b6315d0794a8f014
⬇️ dictionary-si.json
101.2KiB (103.6KB) – 1,024 wordsChecksums (click to show)
MD5: b1ec9efc31d966bd2f90db296e0f2ee4
SHA1: 8d78d28dba97c713e4ede22f386be2c09e04b830
SHA256: 7f24bf30ad9b33d53f399adb9c71eb4bc6af0d1567940789a7c2b18683c41e93
⬇️ dictionary-sk.json
3.150MiB (3.303MB) – 14,953 wordsChecksums (click to show)
MD5: 67fb079be826fd62ea13f6a164ffa964
SHA1: 628c6a6d5a3b19c1fcf8b5cbc67a5c36e123da10
SHA256: 2726072869cd48132763961d6568152caad2f3d6ca154b25902cb53f5132f21c
⬇️ dictionary-skr.json
39.30KiB (40.25KB) – 335 wordsChecksums (click to show)
MD5: 6f2d5ff21076dff8abc32fbfeab32989
SHA1: a379fe5f00cf69de26d5b6e2c7112482c83b4d47
SHA256: 10e4a27181bd136abcf6c7ac20a178bbf1613dce184ab880bd55e5123f78c5e3
⬇️ dictionary-sl.json
1.181MiB (1.238MB) – 6,860 wordsChecksums (click to show)
MD5: 6cd6f016f71396073c237a32ff094819
SHA1: c0121ffa0fd878373dc3825af4740c6c6a98990a
SHA256: 92e311e9f19b974a4cc307e31de018673a298115f6dabee5d0d062aae00f8ddf
⬇️ dictionary-sq.json
2.404MiB (2.521MB) – 18,249 wordsChecksums (click to show)
MD5: d68403ba69cfa3ff7e3b43140618fe10
SHA1: c1adde5ecb3f611f782efeda64b3a784a4bb89bc
SHA256: c4a2aaec154eb4d3cdaa9132ec4db021201656c043b6983e6f8152b926ec4312
⬇️ dictionary-sv.json
27.14MiB (28.46MB) – 286,389 wordsChecksums (click to show)
MD5: 2b450a19e7e27151d404013c77f82115
SHA1: ba38a9b4fe15ee3cfd4c5c129a74a91f0716dedf
SHA256: 86d7aa2adb3253a5d3cd91a30ab756d9192c9264752dbf5a5fdff3225682e344
⬇️ dictionary-szl.json
447.3KiB (458.1KB) – 2,296 wordsChecksums (click to show)
MD5: 461e965553478d7f4c0ad8a87dc6c070
SHA1: 5bafb5e033af0f16c4369b4777afe265520789c1
SHA256: 0499624935d43b3fcdf40e85a38fd26ce87b2125237f3b82129fc7dfb82f1c22
⬇️ dictionary-ta.json
8.219MiB (8.618MB) – 8,585 wordsChecksums (click to show)
MD5: 2b5f006eaa7c3650f99a2e28a644cfef
SHA1: b728e72ee79522ff187724c95512e6756db4e507
SHA256: d30d68b78b1fb2612399bc9b55b99c37038461dd94da03c5c31bda142eb8af59
⬇️ dictionary-te.json
2.622MiB (2.749MB) – 19,782 wordsChecksums (click to show)
MD5: f9f8e4a4fe606d1ded36dd646baa933b
SHA1: 5f7e6f9951927f5c6506ca9f258b4c39eb59b43c
SHA256: 0bef30693d9431e9e324f96b1d4b0cb841777a275cb5593e6a7b0920dc84a672
⬇️ dictionary-tg.json
263.2KiB (269.5KB) – 1,961 wordsChecksums (click to show)
MD5: eb8dc8f9287aaa74b3a7fc99d0f70a54
SHA1: cd4609a2a0f0fc9c8d5e3c12e8275b591cb27fb3
SHA256: 2933eecd495037a1824a0f57fafd9915d369fd66e20c7c16f615ba2e1fd3f4ba
⬇️ dictionary-th.json
2.904MiB (3.045MB) – 16,455 wordsChecksums (click to show)
MD5: 7b6dfbc375a29b0a110db46ee7c20d37
SHA1: 1fdfa8d84367df78e5edbc577de9d2c6565e87df
SHA256: 69eb952e8d7b5c79d3a71a86c01f3dba63b0e3bbcd9ea4e4b1b017f073349e95
⬇️ dictionary-tl.json
5.351MiB (5.611MB) – 26,130 wordsChecksums (click to show)
MD5: 64d0a7ad0020f3f2b645f5bdf324e10d
SHA1: 0c02cb54544edb362a4c21cadbe9884b277047e6
SHA256: 628b8f0a3d44e0ae9553b33514bc1299a71c7cff1dd8b05b8799bc2a355b34d6
⬇️ dictionary-tr.json
38.06MiB (39.91MB) – 33,585 wordsChecksums (click to show)
MD5: 0e6b3c44fa0f3a6890f40d2e6c9a57f1
SHA1: 77dc4823f7ceabde984c6ffccb40860d3c51a022
SHA256: cf8dd678716baca564bbbaf5020658f90430f19eb1648bd6e4d4d39e1082a739
⬇️ dictionary-trs.json
1.579KiB (1.617KB) – 26 wordsChecksums (click to show)
MD5: 31ceddf6a3aba9a4f4226e7e461104fa
SHA1: 256a91073127fdfc4d7cc80474d89774059d4ab2
SHA256: bf7250386c3c47cad204f03e2ba8485d26d7af9b3f9ec90c47f12350134e02eb
⬇️ dictionary-uk.json
16.52MiB (17.32MB) – 47,205 wordsChecksums (click to show)
MD5: c95dc7205aa97f8606aa8454756f8fa7
SHA1: b65617bd1a17175053510b7be9c0ffe024ca4a47
SHA256: 067a2dca41eca39731e5639ce761cefb698eb7e8faa62c7500bfac7dc803a8e3
⬇️ dictionary-ur.json
1.207MiB (1.265MB) – 7,030 wordsChecksums (click to show)
MD5: 1b19d1a14ad02841f52683c97de1f1e6
SHA1: 3fc753b44bb999a9e98504b0e1d0563f85dac2c7
SHA256: 396fd2cc23267c2619100a3bbc175dbe192a3be00343e8e91dc1a9f3a5220d28
⬇️ dictionary-uz.json
2.288MiB (2.399MB) – 3,746 wordsChecksums (click to show)
MD5: 4e71a4e18fd0ef4a931db8bf1ee9b760
SHA1: 35a87599c332f882b4248a4527fa7df889316508
SHA256: f3621e5f5af5ebb3b92f687208bc1d6c806a5f46bdc76b2f561bcab9a11e152a
⬇️ dictionary-vi.json
1.554MiB (1.630MB) – 11,799 wordsChecksums (click to show)
MD5: 05012e8f942c1737e57bc141d8fce07c
SHA1: d3e929fc68f0a016d9d8e37534815e24f595f0aa
SHA256: da18badd8a5bcd10600b6c11db20e7d5dafd5422442cfaa24c80d773fe3ce3b7
⬇️ dictionary-wo.json
41.31KiB (42.30KB) – 665 wordsChecksums (click to show)
MD5: 311b435158b9bcf8f277e7a4c89bb089
SHA1: c6d2c993e6b505c4fae4aea62bee7c63171170ac
SHA256: a265c47c8911f650c0d8b5ea17999cfaf7c04d1e8cfa4ec402146fc7b194dfc3
⬇️ dictionary-xh.json
375.4KiB (384.4KB) – 3,210 wordsChecksums (click to show)
MD5: 9089cc67280bd904986ac7d02a7b3908
SHA1: f3b07a147bdd4a0bbc0725134922458df13cb30a
SHA256: 0dde7ce6683a91cd38b6d7d79eaf4bfaac360a4014387364cb102f49d0b8014b
⬇️ dictionary-zam.json
38B – 1 wordChecksums (click to show)
MD5: 9a51d9bb061770d2bd703c129ea2ada7
SHA1: df4d4cf9f7feb21743c349628b2d582dcd7f10f7
SHA256: 128f4ee65779ef8b2eab83bafba6d87b595cc8b6d24aff01d5b2f751b69a2f63
⬇️ dictionary-zh.json
25.56MiB (26.81MB) – 152,834 wordsChecksums (click to show)
MD5: 8025ae40ec64ada01d5a0da0ada26fa3
SHA1: 8a791dac487d1be87d5c19c9fc95cfce0688b2e3
SHA256: ba887e5d68f82de073902ed30e478134770a2951030981bb6e8f61d4522360c5
Dictionaries
Wiktionary
Uses the English Wiktionary dictionary data. It is created from the Wiktionary dumps, which is converted to a JSON Lines format by kaikki.org using their open source Wiktextract tool. See the Wiktextract paper for more information. The resulting over 15 GiB file for all languages combined is then preprocessed to create a minimal dictionary for each language using the scripts in this repository. The English Wiktionary currently includes words in over 4,400 languages, so the scripts automatically select the around 100 languages supported by Mozilla (Firefox and/or Thunderbird) or those with 50,000 words or more. This includes most modern languages, as well as Latin. The underlining Wiktionary dump files are updated monthly, but kaikki.org updates the extracted JSON files weekly to incorporate improvements made to their Wiktextract tool. If users notice any errors in the data, they should correct them by directly editing Wiktionary and this will automatically be included in the next monthly update.
Licensed under both the Creative Commons Attribution-ShareAlike 3.0 Unported License (CC BY-SA 3.0) and the GNU Free Documentation License (GFDL), so users must attribute it to Wiktionary.
JSON format
Uses the JSON Lines format, where each line is a JSON object for a word in the dictionary. Each JSON object may have the following keys:
* "" (empty string) - String with the word (required)
* "p" - Array of strings with the parts of speech (POS) (required)
* "d" - Array of strings with the definitions of the word (required)
* "f" - Array of strings with the forms of the word (optional)
* "s" - Array of strings with the synonyms of the word (optional)
* "n" - Array of strings with the antonyms of the word (optional)
* "i" - String with the International Phonetic Alphabet (IPA) pronunciation (optional)
* "a" - String with the filename for the pronunciation audio file in OGG Vorbis format (.ogg), add the https://upload.wikimedia.org/wikipedia/commons/ prefix to get the full URL (optional)
* "w" - Array of strings with the titles of the Wikipedia pages about the word, possibly prefixed with a language ID (optional)
Words, forms, synonyms and antonyms with any whitespace characters are excluded, as well as some POS categories that are not words, such as "character", "symbol", "prefix" and "suffix".
JSON format
See above for the specific format of each dictionary.
Contributing
Merge requests welcome! Ideas for contributions:
- Improve the performance of the update scripts.
- Reduce the size of the dictionaries.
- Provide localized versions of the dictionaries.
- Add more dictionaries.
Owner
- Name: Teal Dulcet
- Login: tdulcet
- Kind: user
- Location: Portland, Oregon
- Website: https://www.tealdulcet.com/
- Repositories: 31
- Profile: https://github.com/tdulcet
👨💻 Computer Scientist, BS, CRTGR, MS @Thunderbird Council member
Citation (CITATION.cff)
cff-version: 1.2.0
title: Compact Dictionaries
message: >-
If you use this dataset, please cite it using the metadata
from this file.
type: dataset
authors:
- given-names: Teal
family-names: Dulcet
orcid: 'https://orcid.org/0009-0008-6616-2631'
repository-code: 'https://github.com/tdulcet/compact-dictionaries'
repository: 'https://gitlab.com/tdulcet/compact-dictionaries'
abstract: >-
Preprocessed free English dictionaries/thesauruses in JSON
Lines format that are automatically updated weekly.
license: GPL-3.0
references:
- authors:
- family-names: Ylonen
given-names: Tatu
orcid: 'https://orcid.org/0000-0003-4180-1229'
title: "Wiktextract: Wiktionary as Machine-Readable Structured Data"
type: software
GitHub Events
Total
- Watch event: 2
- Push event: 17
- Pull request event: 2
Last Year
- Watch event: 2
- Push event: 17
- Pull request event: 2
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 0
- Total pull requests: 2
- Average time to close issues: N/A
- Average time to close pull requests: about 12 hours
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 2
- Bot issues: 0
- Bot pull requests: 2
Past Year
- Issues: 0
- Pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: about 15 hours
- Issue authors: 0
- Pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 1
Top Authors
Issue Authors
Pull Request Authors
- dependabot[bot] (2)