Automated vocabulary mining

Upload content.
Get a deck.
Start studying.

Drop any PDF, video, or audio file. Saikutsu extracts vocabulary, builds cloze cards from real sentences, and schedules reviews with FSRS. Minutes, not hours.

We're in private beta. Waitlist members get 50% off Pro for 3 months at launch. Beta tester? Sign in

採掘 — Murakami Ch.3
Card 12 of 83 | Deck: Murakami Ch.3

青豆は、その風景を黙って見つめていた。

風景 (ふうけい)

scenery; landscape; view

1-4 or space for good

FSRS v5 — cloze (self grade) ██▁▁▁▁▁▁▁▁ 14%

The problem

Mining shouldn't take longer than reading.

Look up a word, copy the sentence, format the card, add it to Anki. Repeat 50 times. By then, you've forgotten what you were reading.

30–45 min

spent per session creating cards manually. Most people give up before finishing.

5 tools

PDF reader, dictionary, spreadsheet, Anki, tokenizer. For podcasts, add even more.

0 flow

Context switching between reading and card creation destroys immersion entirely.

Features

Everything you need to mine smarter.

Any input, one pipeline

PDF, video, or audio — novels, podcasts, anime, lectures. We extract or transcribe, tokenize, and build your deck.

FSRS v5 scheduling

State-of-the-art spaced repetition built in. 20–40% fewer reviews for the same retention. No Anki setup required.

Word/definition + i+1 cards

Choose your card type. Word/definition for straight vocabulary. i+1 for sentences with exactly one unknown word — the fastest path to comprehension.

Smart deduplication

Already know a word? We skip it. Every new deck checks your entire collection so you never study the same word twice.

Frequency ranking

Words ranked by document frequency and language-wide frequency. Learn high-value words first.

Deep Anki integration

Import existing decks — we skip words you know. Export new decks as .apkg. No duplicates, ever.

How it works

Three steps. That's it.

01

Upload your content

Drag and drop any PDF, video, or audio file. Novels, podcast episodes, anime, lectures. Japanese, Spanish, French, German, Italian, and Portuguese.

podcast_ep42.mp3 18.7 MB
02

We do the mining

Transcription, tokenization, dictionary lookups, frequency analysis, i+1 sentence detection, deduplication against your existing cards. Usually under 5 minutes.

Audio transcribed
Tokenized — 312 words (47 already known, skipped)
Found 83 i+1 sentences
Building cards...
03

Start studying

Your deck is ready. Review in-app with FSRS scheduling, or export to Anki. Learning in minutes instead of hours.

312 cards ready
Start Review →

Pricing

Free to start. Upgrade when you need to.

We're in private beta right now. Join the waitlist for early access and get 50% off Pro for 3 months at launch.

Free

$0/mo

  • Unlimited reviews
  • Unlimited decks
  • Unlimited Anki import
  • 1 upload / month
  • 1 Anki export / month
Join the Waitlist

Pro Monthly

$10/mo

  • Everything in Free, plus:
  • Unlimited uploads
  • Unlimited Anki exports
  • Priority support
Join the Waitlist
SAVE 25%

Pro Annual

$90/yr

$7.50/mo — save $30 a year

  • Everything in Free, plus:
  • Unlimited uploads
  • Unlimited Anki exports
  • Priority support
Join the Waitlist

Already a beta tester? Sign in

Compatibility

What about my other tools?

Saikutsu isn't here to replace your workflow. It fits right into it.

anki

Works alongside Anki

Already have thousands of Anki cards? Good — keep them. Import your existing decks so we know what you've already learned, and we'll skip those words when mining new content. When you're done, export your new cards as .apkg and review them in Anki if you prefer.

Saikutsu handles the mining. Anki handles the reviewing. Use one, use both — your vocabulary stays in sync either way.

yomitan

Works alongside Yomitan

Yomitan is great for what it does — hover over a word while you're reading, get an instant definition, and mine it on the spot. That's realtime, one-at-a-time mining, and it's perfect for immersion reading.

Saikutsu is for everything else. Drop an entire PDF, a podcast episode, or a movie file and extract hundreds of vocabulary words at once. Batch mining and realtime mining solve different problems — use both.

FAQ

Common questions.

Which languages are supported? +

We currently support Japanese, Spanish, French, German, Italian, and Portuguese. All languages are available on every plan, including Free. We're actively adding more.

How does tokenization work for different languages? +

For Japanese, we use MeCab-compatible tokenization with full dictionary lookup. For European languages (Spanish, French, German, Italian, Portuguese), we use lemmatization to reduce words to their base dictionary form — so conjugated verbs, plural nouns, and declined adjectives all map back to the root word you actually need to learn. No more creating separate cards for 'hablar,' 'hablando,' and 'hablo.'

I already use Anki. Why would I switch? +

You don't have to. Import your existing decks and we'll skip words you already know. Export new decks as .apkg. No duplicates. Use both tools together.

What's FSRS? +

Free Spaced Repetition Scheduler — a modern, open-source algorithm more efficient than SM-2. It learns your memory patterns and optimizes review timing. 20-40% fewer reviews for the same retention.

What formats are supported? +

Video: MP4, MKV, WebM, MOV. Audio: MP3, M4A, WAV, OGG, FLAC. Documents: PDF with selectable text. Files over 25MB are automatically chunked for transcription.

What's an i+1 sentence? +

A sentence where you know every word except one. It's the ideal context for learning — enough comprehension to guess meaning from context, with exactly one new word to acquire. We find these automatically across your content.

What happens with words I already know? +

We check every word against your entire card collection across all decks. Known words are skipped automatically. No duplicates, no wasted reviews.

Can I use scanned PDFs? +

Not yet. We need digital text to extract vocabulary. If your PDF has selectable text, you're good. We're working hard to get OCR working to support every type of PDF, but the quality is not up to snuff quite yet.

Found a bug? +

Email kamdyn@saikutsu.com or visit our contact page. We're a small team building this because we use it ourselves.

Ready to automate your mining?

We're in private beta. Join the waitlist and be first in line when we open up. Waitlist members get 50% off Pro for 3 months.