Google’s 10,000 English Words:
https://github.com/first20hours/google-10000-english
GitHub:
Search for “child-friendly word list” or “clean word list” for community-curated options.
Automated Filtering (for Offline use)
You can write a simple script to remove words that match patterns or are on a “blocklist.” Here’s a basic approach:
Blocklist: Maintain a list of offensive words and remove them.
Pattern Matching: Remove words with certain patterns (e.g., slurs, vulgarities).
Length/Complexity: For very young children, you might also filter by word length or reading level.