Improve the ability to search chat history for Asian regional languages, such as Chinese and Japanese. Telegram's chat history search function is based on words, and is suitable for languages such as English and Russian that are separated by spaces. But for languages that do not use spaces for word segmentation, you need to match the entire sentence to get the search results, which makes the search function of some languages almost unusable. I hope Telegram can improve this feature so that it can search languages that do not use spaces to separate words.
The issue is a little stranger than I had initially thought. It's always been my experience that when I try to search for anything on Telegram, I only get partial results, often missing the message that I was trying to find. I had chalked it up to Telegram not being able to run partial searches, but I just ran a few tests and even that doesn't seem entirely true.
When I am searching for text in Chinese or Japanese, I get almost no result back, as entire sentences are written with no spaces in between words. My experience with Korean is a lot better, as they DO use spaces to break up words, but they don't break up everything. Most notably, (I guess I can only write in English here,,, words in brackets are expressed as a single, unbroken chain of character in Korean)
BE verb is expressed through a suffix added to the noun itself [Airplane is] flying
Some compund nouns are not separated by spaces [Airplane ticket]
In this scenario, searching for ”airplane” will return neither of those examples.
....except, that it SOMETIMES does, and I cannot figure out why only SOME of those are returned when searched. All of this ends up giving the users a very inconsistent search experience
Mingye Wang
In addition to allowing naive substring search, true word search can be quite easily implemented with SQLite's FTS tokenizers. The ICU tokenizer works well for a lot of languages. It might oversplit a bit, but that doesn't mess up searching.
D
Deleted Account
Even though O have my conversations in English mostly, I'd like to see this coming. I can confirm this on Telegram Desktop for Windows.
D
Deleted Account
Should also improve the ability to search files for Asian regional languages
Kevin Kelly
Telegram's search engine can't work well to Chinese Japanese or Korean characters, because the engine identifies the strings by spaces and punctuations, but Asian characters no spaces between words therefore Asians use the engine difficultly.
I have two available suggestions for the search engine: overhead the engine, search for characters directly; or do not overhead the search engine, add the invisible spaces between Chinese Japanese and Korean characters.
M
Mia Zoe
I would also like it for Spanish, it would be easier to search for conversations / chat in keywords or related to what you wrote in a message
Jervis
Will there ever be an update on this suggestion? This is a very inconvenient issue for us Asian users, and I would be more than happy to see it added.
Log in here to report bugs or suggest features. Please enter your phone number in the international format and we will send a confirmation message to your account via Telegram.
When I am searching for text in Chinese or Japanese, I get almost no result back, as entire sentences are written with no spaces in between words. My experience with Korean is a lot better, as they DO use spaces to break up words, but they don't break up everything. Most notably, (I guess I can only write in English here,,, words in brackets are expressed as a single, unbroken chain of character in Korean)
BE verb is expressed through a suffix added to the noun itself
[Airplane is] flying
Some compund nouns are not separated by spaces
[Airplane ticket]
In this scenario, searching for ”airplane” will return neither of those examples.
....except, that it SOMETIMES does, and I cannot figure out why only SOME of those are returned when searched. All of this ends up giving the users a very inconsistent search experience
I have two available suggestions for the search engine: overhead the engine, search for characters directly; or do not overhead the search engine, add the invisible spaces between Chinese Japanese and Korean characters.
It would be great for Russian, too. I guess simple entry strict search of any entered symbol sequence as an option would solve the problem.