Elasticsearch sort by text field multilanguage values inside

Elasticsearch sort by text field multilanguage values inside - sorting

I have documents in a few languages. For example Russian. When I sort with text field it is not sorting with Russian letters ordering. Maybe there are some analyzers which can help me? Thanks in advance.

Related

How to value exact match higher than term frequency in elasticsearch?

I have an index that has several title fields.
main_title,
sub_titles,
preferred_titles
etc.
These texts fields also have a suggest field each where I run a custom analyzer that uses edge-n-gram tokenizer so we can search as we type.
I would like to value exact match over term frequency. And exact match in main_title is worth more than exact match in preferred_titles.
Anyone know how I can achieve this? Thanks in advance.
I have tried a bool_query with multi_match_query in the must clause. The multi_match is crossfields with no fields attached with the operator 'and'.
I have both the text fields and the suggest fields in the should cluase. Each text field is in a match_query with a boost and the operator 'and'. Each suggest field is in a match_phrase_query with a boost and the operator 'and'. The issue is that several boosts are added on top of the scores and I end up with very inflated scores.

Elasicsearch- Indexing documents based on change of sequence of characters

I have a set of documents that has to be indexed in the following way:
1. fs$#56s as fs,$#,56,s
2. FSand-hgfh as fsand,-,hgfh
Is there a way to do so?
Thank you!

Check for specific words that is not exist in document

I want to search through my document to count the number of documents that contains specific word and other that does not have that specific word in kibana4.
can anyone help me about this?
Thanks

Use data table visualization
Use the filter agg type.
And give "NOT fieldName:missingValue" , to show the missing documents.
and "fieldName:value" , to show documents having the word.

Elasticsearch: the mapping for a document with multiple languages for the same field?

I have a document type which has a single data item called title. Here are some situations about it:
Title can have multiple languages for a single document. One document
may have just English title. Another document may have two (such as
English and Spanish) or more languages for title. Currently, there
are four languages (English, Spanish, French, and Chinese) for
documents, but there will be more languages in the future when the system grows.
Title is used for search. For a document with both English title and
Spanish title, it should be returned if an English search term is
found in the English title or a Spanish search term is found in the
Spanish title.
Elasticsearch should return ONE document in search results if a
search term is found in different language titles of the same
document.
There can be many updates to some documents. Example updates includes deleting a document, change one language title and leave other language titles intact, delete one language title and leave other language title intact.
What is the right mapping for this document type, with document indexing, changes, delete, search, and system growth in mind?
Any input is deeply appreciated!
Regards.

The mapping of the title should work with object type whose properties are included several language fields and each field can apply different analyzer according to its language.
During searching, you might specify the field to be title.en to quickly find the English title, or title.* to query all the title fields depends on what your need.

Oracle text search default Stop words for non english locales

Oracle documentation lists following default stop words.
http://docs.oracle.com/cd/B28359_01/text.111/b28304/astopsup.htm#CCREF1400
This includes stop words from all languages. But when i query my database to view list of stop words, it only shows English words.
SELECT * FROM ctx_stopwords;
It doesn't list non english stop words. I may be missing something here. I am looking for a query which returns all the default oracle stop words in all languages. Is there a way to achieve this?
Thanks!

Look for scripts in $ORACLE_HOME/ctx/admin/defaults/
drdefus.sql - american
drdefd.sql - german

Develop Reference

ruby bash windows laravel spring algorithm oracle macos go visual-studio

Elasticsearch sort by text field multilanguage values inside - sorting

I have documents in a few languages. For example Russian. When I sort with text field it is not sorting with Russian letters ordering. Maybe there are some analyzers which can help me? Thanks in advance.

Related

How to value exact match higher than term frequency in elasticsearch?

Elasicsearch- Indexing documents based on change of sequence of characters

Check for specific words that is not exist in document

Elasticsearch: the mapping for a document with multiple languages for the same field?

Oracle text search default Stop words for non english locales

Categories

Resources