Kontext. Kern. Karte. (Context. Core. Card.)
The name Kardenwort evokes "carding"—the process of combing and straightening tangled fibers. In the same way, our toolkit processes raw, complex texts, meticulously organizing them into simple, clear, and context-rich vocabulary lists for language learners.
This philosophy is deeply inspired by Rich Hickey's influential talk, "Simple Made Easy." Kardenwort embodies the principle that achieving simplicity is hard work. The tool undertakes the difficult task of parsing linguistic complexity to produce a result that is truly simple: an untangled, organized list. This foundational simplicity, in turn, makes the process of creating cards and learning vocabulary easy for the end-user.
Our approach is built on a simple but powerful mantra: Kontext. Kern. Karte.
- Kontext (Context): Words cannot be learned in a vacuum. Kardenwort ensures that every vocabulary item retains its original sentence and surrounding phrases, allowing you to understand usage, nuance, and grammar naturally.
- Kern (Core): Our engine drills down to find the lexical kernel (the lemma or base form) of every word. It intelligently handles complex grammar to give you the most accurate and useful information.
- Karte (Card): The final output is a perfectly structured dataset, ready to become a rich learning card in Anki or a clear list in GoldenDict. It's not just a word; it's a data-filled canvas for learning.
All you need is Plain Text in the AI era.
- A Personal Journey: The "Why"
- Core Philosophy: Our Approach to Language Learning
- Key Advantages of the Kardenwort Method
- Who Can Benefit?
- The Vision: A Bridge Between Cultures
- Our Ecosystem
- License & Creator
Several years ago, I faced the most significant challenge of my life: my first serious attempt to learn a foreign language, German, while living abroad. It was incredibly difficult. With a 12-year background as a systems administrator and the last 7 years deeply involved in software development, I turned to the tools I knew best. I created Anki templates, manually entered thousands of words using tools like VocabSieve, and delved into sentence mining.
This journey led me to a simple truth: everything is the word, and context is everything. It became clear that different skills must be trained in separate sessions, and consistency is paramount. I realized that a solid grasp of grammar is a powerful accelerator. I created a set of five tables that map the entire German grammar up to the C1 level, which I use as my guide. This process ignited a lasting interest in computational linguistics, right as the AI boom began. I passed my German B1 exam.
Throughout this time, I observed my colleagues' struggles and tried to address them in my tools. I arrived at the conclusion that the most effective methods have been known for a long time. The most reliable things are simple. The key is to reduce cognitive load wherever it isn't the primary goal of the activity.
My first native language is Russian, my second is Ukrainian. I am learning German for life in Germany and English for my work in IT. This multicultural experience fuels my motivation. I strive to make this utility accessible to a broad audience, to be able to share it with a colleague and explain the approach, to help someone facing the same life circumstances many of us find ourselves in today. I believe in mutual prosperity; if everyone in society thrives, so do I. Together, we can lift each other up towards a brighter future.
We are building not just tools, but a philosophy—a set of high-level concepts for language acquisition that prioritizes clarity, efficiency, and learner autonomy.
Our philosophy is supported by three core principles:
- Openness and Free Access: This is an open-source development that can be used anywhere without restrictions, fostering a community of learners and developers.
- Your Data in Your Hands: The project does not require registration or subscriptions. You are not dependent on third-party services that might one day charge a fee for access to the very cards you spent hours creating.
- Maximum Flexibility: The output of the utility is a starting point, not a rigid system. You get full control over your learning materials to adapt them to your personal needs.
First, there was the word. Words are the fundamental building blocks of language. You can combine them in countless ways, and even without perfect grammar, a strong vocabulary allows you to communicate. Acquiring words is the most challenging and time-consuming part of learning a language; it requires constant, sustained effort. This program is dedicated to improving comprehension by breaking down language to its core component: the word. Understanding words in their varied contexts is the essential first step before we can effectively produce speech.
German is largely a synthetic language, meaning it relies heavily on inflections (changes in word endings) and compounding (joining words together) to convey grammatical relationships. This contrasts with analytic languages like English, which use word order and helper words. This synthetic nature is why a single German word can be so intimidating and hard to find in a dictionary.
The real challenge isn't looking up a translation; it's correctly deconstructing an inflected or compound word to find its base form (lemma). Kardenwort was created specifically to fight this complexity. It automates the process of decomposition, allowing you to understand any text by breaking it down into its constituent parts.
This Russian aphorism ("простой как пень") captures our core design principle. The tools and methods must be incredibly simple to integrate into a regular study routine. They should not add extra cognitive load to the primary task: analyzing foreign language material. The idea is to set it up once and then just use it. This approach is as fundamental as annotating words in the margins of a physical book—a practice Kardenwort digitizes and automates.
I am a firm believer in the Zettelkasten method, which uses unique identifiers (ZID) to link notes. This inspiration is reflected in our workflow: Kardenwort generates a unique, timestamp-based ID for each export. This ZID is used as the filename and the Anki deck name, creating a clear link between a specific text and the vocabulary derived from it.
Our process deliberately separates the act of reading from the act of vocabulary acquisition. When you read, you can fully immerse yourself in the material without constantly stopping to look up words and create flashcards. Later, in a dedicated session, you can efficiently work through the pre-processed, context-rich vocabulary list. This division of labor makes both activities more focused and effective.
Many language learning apps, unlike many dedicated reading applications (such as LWT, Lute, VocabSieve, LinguaCafe, alexandria-reader, LingQ, Readlang, LanguageCrush, lemmatize, Smart Book - KursX, KOReader, and others), are e-readers that work well for novels but fail with complex, non-linear content. Technical books, academic papers, and articles are filled with essential diagrams, code snippets, formulas, and footnotes. E-readers strip this context away.
Kardenwort acts as a companion to your original source material, not a replacement. You can read from a physical book, a PDF, or any other source. Our tool decouples the vocabulary work from the medium, allowing you to preserve the full, rich context of the original. The ZID ensures you can always trace your vocabulary back to the source text.
The accuracy of NLP analysis is a compromise. Kardenwort bridges this gap by giving you direct control. Through a simple rules file (lemma_override.tsv
), you can "train" the system by providing corrections for your specific texts. Once a rule is added, it is saved forever and automatically reapplied in the future. This allows you to achieve perfect processing for your materials without repeating work.
While other platforms exist, Anki is superior for serious learners for several key reasons:
- Full Control & Customization: You control every aspect of your cards and study algorithm.
- Offline First: Your data is yours. It lives on your machine, accessible anytime, anywhere.
- Open Ecosystem: A massive community provides add-ons to tailor your workflow.
- Overcoming Complexity: Anki's initial learning curve can be steep. Our pre-configured
20250913123501-kardenwort-anki-templates
project eliminates this barrier, providing a powerful, ready-to-use template out of the box.
Decks themselves are cheap to create and ultimately disposable. Like equipment at a gym, they are tools to build strength. The knowledge should end up in your head, not locked in a proprietary system.
Kardenwort is completely independent of the internet and resource-hungry AI models.
- Privacy: You are not sending your data to any third-party service for processing or to train their models. No one knows what you are reading. This is crucial when working with copyrighted, confidential, or sensitive materials.
- Reliability: It delivers consistent, high-quality results without relying on a network connection or powerful hardware.
(Note: While Kardenwort itself is offline, your data will be sent to Anki's servers if you choose to use their synchronization service.)
The modern world is filled with distractions. Language learning apps often compound this with complex interfaces and notifications. Kardenwort offers a focused, streamlined alternative to distracting apps:
- Measurable Progress: Learning is broken down into small, manageable steps—one card at a time—providing a clear sense of accomplishment.
- Gamified Feedback Loop: The process of finding a complex word and "catching" it by adding a rule to your override file is highly engaging, creating a positive feedback loop.
- Distraction-Free Environment: Work offline, away from the noise of the internet. You can even print your decks and study on paper.
- Set It and Forget It: The system is designed to be configured once. You spend your time learning, not fiddling with settings in a complicated UI.
- Auditory-Centered Learning: We learn living languages. The cards are built around audio to prepare you for producing and understanding real speech. This also enhances reading comprehension, even when reading silently.
- The Autonomous Learner: Learn on your own terms. Use AI to generate texts on topics you love, then use Kardenwort to extract the vocabulary. Explore the same topic from multiple angles to build a rich, multi-faceted vocabulary.
- The Methodical Learner: Our approach is similar to the time-tested Grammar-Translation Method. It provides a solid foundation, especially at the beginner stages, and can be integrated into any other methodology, including the popular communicative approach.
Currently, Kardenwort fully supports English (en) and German (de). We plan to expand support to include Russian (ru) and Ukrainian (uk). Our goal is to foster linguistic exchange, helping people from different cultures understand each other better, live together happily, and build a brighter, more developed future.
Kardenwort is a suite of integrated tools designed to work together seamlessly:
20250913122858-kardenwort
: The core intelligent engine for text processing and vocabulary extraction.20250913123240-kardenwort-anki-csv-importer
: The bridge that automatically imports the generated vocabulary files into Anki.20250913123501-kardenwort-anki-templates
: The powerful and feature-rich Anki card template that brings your vocabulary to life.
This project was created by and is maintained by Denis Novikov (voothi).
It is licensed under the MIT License. See the LICENSE
file in the project repository for full details.