This is very cool, looking forward to it! I've been doing the same thing with Spanish Wikipedia articles for a while, using a few lines of Bash + Regex. I was using Apertium for it. https://apertium.org/ It's definitely worse than most ML-based solutions, but it works reliably and fast; you can run it entirely offline. With Spanish translations, the main problem I was facing is lack of vocabulary, so I created - Source: Hacker News / 9 months ago
I used to keep track of the state of machine translation some years back. I think the way you measure the success of an automated translation is edit distance, i.e. How many manual edits you need to make to a translated text before you reach some acceptable state. I suppose it's somewhat subjective, but it is possible to construct a benchmark and allow for multiple correct results. The best resources I knew back... - Source: Hacker News / almost 2 years ago
Apertium is one of them. We make open-source rule-based machine translation systems, and our core tools are in C++. A few of our proposed ideas involve modifying those C++ tools with new features or improvements to existing features. Source: about 3 years ago
Do you know an article comparing Apertium to other products?
Suggest a link to a post with product alternatives.
This is an informative page about Apertium. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.