Package Exports
- @danskify/dictionary
- @danskify/dictionary/dist/index.js
This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (@danskify/dictionary) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.
Readme
@danskify/dictionary
About
This package was created by the Danskify project https://danskify.com as part of an open-data initiative to provide accessible English–Danish vocabulary resources.
It converts and repackages the Wiktionary dataset originally compiled by
Matthias Buchmeier and contributors into a JSON format suitable for modern web applications.
Data processing and enhancements
This dataset is not a raw copy of the original Wiktionary export.The source English–Danish dictionary compiled by Matthias Buchmeier and other Wiktionary contributors was used as a starting point and then significantly refined by the Danskify project.
Processing steps include:
- Data cleaning: removing malformed, duplicate, or incomplete entries.
- Quality filtering: dropping low-confidence translations based on semantic similarity using
Xenova/distiluse-base-multilingual-cased-v2. - Category pruning: excluding entries classified as article, interjection, abbreviation, prefix, suffix, and proverb.
- Normalization: converting data from .txt to JSON, standardizing field names, and adding optional metadata (e.g.,
wordCount,form).
As a result, this dataset represents a curated derivative work of the Wiktionary material, not an official subset or mirror.
License and Provenance
Data derived from:
English–Danish Wiktionary dataset
Compiled by User: Matthias Buchmeier and contributors
Version 20200401
Licensed under the Creative Commons Attribution–ShareAlike 3.0 Unported License (CC BY-SA 3.0).
© 2002–2020 Wiktionary contributors
© 2025 Danskify contributors (data cleaning, filtering, and JSON conversion)
This dataset was heavily curated and transformed from the original Wiktionary export.
Processing steps included data normalization, removal of malformed and duplicate entries, semantic similarity filtering (using Xenova/distiluse-base-multilingual-cased-v2), and exclusion of certain word classes such as article, interjection, abbreviation, and prefix. These modifications aim to improve translation quality and consistency while preserving the open-data spirit of the original work.
This derivative dataset is distributed under the same CC BY-SA 3.0 license.
This package was created for and is used by Danskify.com. No endorsement by Wiktionary or the Wikimedia Foundation is implied.
License selection
The original Wiktionary dataset was dual-licensed under CC BY-SA 3.0 or the GNU Free Documentation License.
This derivative package intentionally adopts CC BY-SA 3.0 Unported only, as allowed by the “or alternatively” clause.
🪶 Attribution (for UIs)
Translation data © Wiktionary contributors (Matthias Buchmeier et al.), CC BY-SA 3.0 — en.wiktionary.org