-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support arbitrary languages #33
Comments
Note that photon cannot assume that western scripts are inserted, e.g. Chinese, Arabic or Thai should work fine as well (Nominatim supports this too). |
how do you guess the user language in nominatim? by analyzing http headers / ip address? |
Nominatim accepts an |
For nominatim: accept-language parameter first and then HTTP accept-language header but this is used only for selecting the right name:* to output. It is not used in any way to rank the query matches or results. |
Thanks @lonvia ! |
Language detection could be done via a simple hack I made some years ago or other tools like described here |
sorry, wrong ticket ;-) |
My comment here is relevant in this ticket as well. I'll hopefully have time in the next days to try that. |
I am creating a language detector specific for 'local' names like street names and POIs - i.e. the detector learns from OpenStreetMap data itself. Look here It is already good for German stuff (4% of German names not detected as German), but e.g. french and english detection is really bad (30% error). I'll see how I can improve this. Update 2020: there is a new tool https://github.com/pemistahl/lingua |
One of the biggest problems yet to resolve for full multilingual support is the storage size. The current elasticsearch config let size grow linear with the number of languages (approx 30 GB each). I hope the cross_field approach mentioned previously can help. |
@lonvia , using Photon for customized data on Nominatim database does not support any languages other than en, fr, de, it. How can we search results by languages such as Arabic? |
Photon should be able to do searches over all available language variants mapped in OSM (i.e. all name:* tags). Bonus points if it can improve search results by guessing the language of the query right and re-weight the results accordingly.
The text was updated successfully, but these errors were encountered: