summaryrefslogtreecommitdiffstats
path: root/ToDo
diff options
context:
space:
mode:
authorSuren A. Chilingaryan <csa@dside.dyndns.org>2005-08-07 21:41:32 +0000
committerSuren A. Chilingaryan <csa@dside.dyndns.org>2005-08-07 21:41:32 +0000
commit9d4628df369b92016b7fc3bfc7fed6d06ff2ca9a (patch)
tree9d78892cf391d2cdb5da216cfec78a8b565751c7 /ToDo
parent94ca629ceec7b0dc9f6f724b2e15923d3ec1d5b3 (diff)
downloadlibrcc-9d4628df369b92016b7fc3bfc7fed6d06ff2ca9a.tar.gz
librcc-9d4628df369b92016b7fc3bfc7fed6d06ff2ca9a.tar.bz2
librcc-9d4628df369b92016b7fc3bfc7fed6d06ff2ca9a.tar.xz
librcc-9d4628df369b92016b7fc3bfc7fed6d06ff2ca9a.zip
- Russian autoengine is renamed to LibRCD
- Fix Learning with Language Autodetection switched on - Attempt to perform rccFS with Language Autodetection switched off, if failed with default behaviour. - Systematization of translation engine: + Rearangement of the translation modes: OFF, TO_ENGLISH, SKIP_RELATED, SKIP_PARRENT, FULL. + New class types: TRANSLATE_LOCALE, TRANSLATE_CURRENT, TRANSLATE_FROM. - Detect "Unicode" locales for foreign languages - "out" class is assumed to be TRANSLATE_LOCALE - Respect RCC_CLASS_KNOWN - Check for Latin UTF-8 prior to running any charset detection engine.
Diffstat (limited to 'ToDo')
-rw-r--r--ToDo8
1 files changed, 8 insertions, 0 deletions
diff --git a/ToDo b/ToDo
index 214495f..db0515f 100644
--- a/ToDo
+++ b/ToDo
@@ -10,6 +10,14 @@
- Revise locking subsystem
- Libtranslate can leave translated message partly in old language. This causes problems
because of recoding from UTF8 to Current language. (With UTF-8 encoding should be Okey).
+ - Lating languages. If in the string all characters < 0x7F then we have one of the Latin
+ languages?
+ - Statistic approach of language detection.
+ - LibRCD autolearning using db4
+ + Charset detection
+ + Language detection (same as charsets, but for UTF8...)
+ * Consider word recognition based on probability
+ + Autolearning is triggered by large enough dictionary words
1.x: