Automated vocabulary mining
Upload content.
Get a deck.
Start studying.
Drop any PDF, video, or audio file. Saikutsu extracts vocabulary, builds cloze cards from real sentences, and schedules reviews with FSRS. Minutes, not hours.
We're in private beta. Waitlist members get 50% off Pro for 3 months at launch. Beta tester? Sign in
青豆は、その風景を黙って見つめていた。
scenery; landscape; view
1-4 or space for good
The problem
Mining shouldn't take longer than reading.
Look up a word, copy the sentence, format the card, add it to Anki. Repeat 50 times. By then, you've forgotten what you were reading.
30–45 min
spent per session creating cards manually. Most people give up before finishing.
5 tools
PDF reader, dictionary, spreadsheet, Anki, tokenizer. For podcasts, add even more.
0 flow
Context switching between reading and card creation destroys immersion entirely.
Features
Everything you need to mine smarter.
Any input, one pipeline
PDF, video, or audio — novels, podcasts, anime, lectures. We extract or transcribe, tokenize, and build your deck.
FSRS v5 scheduling
State-of-the-art spaced repetition built in. 20–40% fewer reviews for the same retention. No Anki setup required.
Word/definition + i+1 cards
Choose your card type. Word/definition for straight vocabulary. i+1 for sentences with exactly one unknown word — the fastest path to comprehension.
Smart deduplication
Already know a word? We skip it. Every new deck checks your entire collection so you never study the same word twice.
Frequency ranking
Words ranked by document frequency and language-wide frequency. Learn high-value words first.
Deep Anki integration
Import existing decks — we skip words you know. Export new decks as .apkg. No duplicates, ever.
How it works
Three steps. That's it.
Upload your content
Drag and drop any PDF, video, or audio file. Novels, podcast episodes, anime, lectures. Japanese, Spanish, French, German, Italian, and Portuguese.
We do the mining
Transcription, tokenization, dictionary lookups, frequency analysis, i+1 sentence detection, deduplication against your existing cards. Usually under 5 minutes.
Start studying
Your deck is ready. Review in-app with FSRS scheduling, or export to Anki. Learning in minutes instead of hours.
Pricing
Free to start. Upgrade when you need to.
We're in private beta right now. Join the waitlist for early access and get 50% off Pro for 3 months at launch.
Free
$0/mo
- Unlimited reviews
- Unlimited decks
- Unlimited Anki import
- 1 upload / month
- 1 Anki export / month
Pro Monthly
$10/mo
- Everything in Free, plus:
- Unlimited uploads
- Unlimited Anki exports
- Priority support
Pro Annual
$90/yr
$7.50/mo — save $30 a year
- Everything in Free, plus:
- Unlimited uploads
- Unlimited Anki exports
- Priority support
Already a beta tester? Sign in
Compatibility
What about my other tools?
Saikutsu isn't here to replace your workflow. It fits right into it.
Works alongside Anki
Already have thousands of Anki cards? Good — keep them. Import your existing decks so we know what you've already learned, and we'll skip those words when mining new content. When you're done, export your new cards as .apkg and review them in Anki if you prefer.
Saikutsu handles the mining. Anki handles the reviewing. Use one, use both — your vocabulary stays in sync either way.
Works alongside Yomitan
Yomitan is great for what it does — hover over a word while you're reading, get an instant definition, and mine it on the spot. That's realtime, one-at-a-time mining, and it's perfect for immersion reading.
Saikutsu is for everything else. Drop an entire PDF, a podcast episode, or a movie file and extract hundreds of vocabulary words at once. Batch mining and realtime mining solve different problems — use both.
FAQ
Common questions.
Which languages are supported? +
We currently support Japanese, Spanish, French, German, Italian, and Portuguese. All languages are available on every plan, including Free. We're actively adding more.
How does tokenization work for different languages? +
For Japanese, we use MeCab-compatible tokenization with full dictionary lookup. For European languages (Spanish, French, German, Italian, Portuguese), we use lemmatization to reduce words to their base dictionary form — so conjugated verbs, plural nouns, and declined adjectives all map back to the root word you actually need to learn. No more creating separate cards for 'hablar,' 'hablando,' and 'hablo.'
I already use Anki. Why would I switch? +
You don't have to. Import your existing decks and we'll skip words you already know. Export new decks as .apkg. No duplicates. Use both tools together.
What's FSRS? +
Free Spaced Repetition Scheduler — a modern, open-source algorithm more efficient than SM-2. It learns your memory patterns and optimizes review timing. 20-40% fewer reviews for the same retention.
What formats are supported? +
Video: MP4, MKV, WebM, MOV. Audio: MP3, M4A, WAV, OGG, FLAC. Documents: PDF with selectable text. Files over 25MB are automatically chunked for transcription.
What's an i+1 sentence? +
A sentence where you know every word except one. It's the ideal context for learning — enough comprehension to guess meaning from context, with exactly one new word to acquire. We find these automatically across your content.
What happens with words I already know? +
We check every word against your entire card collection across all decks. Known words are skipped automatically. No duplicates, no wasted reviews.
Can I use scanned PDFs? +
Not yet. We need digital text to extract vocabulary. If your PDF has selectable text, you're good. We're working hard to get OCR working to support every type of PDF, but the quality is not up to snuff quite yet.
Found a bug? +
Email kamdyn@saikutsu.com or visit our contact page. We're a small team building this because we use it ourselves.
Ready to automate your mining?
We're in private beta. Join the waitlist and be first in line when we open up. Waitlist members get 50% off Pro for 3 months.