Bitbucket is a code hosting site with unlimited public and private repositories. We're also free for small teams!

Close

Katakanizer

Small script to convert Croatian writing (which is phonetic) into similar-sounding katakana.

Primarily written for playing with text-to-speech engines, which traditionally have very poor support for Croatian (but there are many engines that support Japanese very well). Since it's easier to convert from phonetic to phonetic language, Japanese was an obvious target language.

Initially I converted to hiragana, since it doesn't really matter which script one converts to when it comes to these engines. But, katakana is what foreign words are supposed to be written in, so the hiraganized string is converted into katakana.

Spaces are intentionally not converted into cdots, since that breaks at least Google's TTS engine (inserting too much pauses).

Again, it mostly works for Croatian writing system. It may or may not work even for Croatian, much less any other writing system. This is a toy project.

(c) 2013 Ivan Vučica. See LICENSE.md for license information.

Recent activity

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for ProjectModifiedEvent.java.
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.