On this page本页内容
MongoDB Atlas Search
Atlas Search makes it easy to build fast, relevance-based search capabilities on top of your MongoDB data. Atlas Search可以轻松地在MongoDB数据的基础上构建快速、基于相关性的搜索功能。Try it today on MongoDB Atlas, our fully managed database as a service.今天就在MongoDB Atlas上试试吧,这是我们全面管理的数据库即服务。
$text
¶$text
performs a text search on the content of the fields indexed with a text index. 对使用文本索引索引的字段的内容执行文本搜索。A $text
expression has the following syntax:$text
语法如下所示:
Changed in version 3.2.在版本3.2中更改。
The $text
operator accepts a text query document with the following fields:$text
运算符接受包含以下字段的文本查询文档:
$search |
string | OR search of the terms unless specified as a phrase. OR 搜索,除非指定为短语。 |
$language |
string |
|
$caseSensitive |
boolean |
|
$diacriticSensitive |
boolean |
|
The $text
operator, by default, does not return results sorted in terms of the results’ scores. For more information on sorting by the text search scores, see the Text Score documentation.
$text
expression.$text
表达式。$text
query can not appear in $nor
expressions.$text
query can not appear in $elemMatch
query expressions or $elemMatch
projection expressions.$text
query in an $or
expression, all clauses in the $or
array must be indexed.hint()
if the query includes a $text
query expression.$natural
sort order if the query includes a $text
expression.$text
expression, which requires a special text index, with a query operator that requires a different type of special index. For example you cannot combine $text
expression with the $near
operator.If using the $text
operator in aggregation, the following restrictions also apply.
$match
stage that includes a $text
must be the first stage in the pipeline.text
operator can only occur once in the stage.text
operator expression cannot appear in $or
or $not
expressions.$meta
aggregation expression in the $sort
stage.$search
Field¶In the $search
field, specify a string of words that the text
operator parses and uses to query the text index.
The text
operator treats most punctuation in the string as delimiters, except a hyphen-minus (-
) that negates term or an escaped double quotes \"
that specifies a phrase.
To match on a phrase, as opposed to individual terms, enclose the phrase in escaped double quotes (\"
), as in:
If the $search
string includes a phrase and individual terms, text search will only match the documents that include the phrase.
For example, passed a $search
string:
The $text
operator searches for the phrase "ssl certificate"
.
Prefixing a word with a hyphen-minus (-
) negates a word:
pre-market
, is not a negation. If used in a hyphenated word, $text
operator treats the hyphen-minus (-
) as a delimiter. To negate the word market
in this instance, include a space between pre
and -market
, i.e., pre -market
.The $text
operator adds all negations to the query with the logical AND
operator.
The $text
operator ignores language-specific stop words, such as the
and and
in English.
For case insensitive and diacritic insensitive text searches, the $text
operator matches on the complete stemmed word. So if a document field contains the word blueberry
, a search on the term blue
will not match. However, blueberry
or blueberries
will match.
For case sensitive search (i.e. $caseSensitive: true
), if the suffix stem contains uppercase letters, the $text
operator matches on the exact word.
For diacritic sensitive search (i.e. $diacriticSensitive: true
), if the suffix stem contains the diacritic mark or marks, the $text
operator matches on the exact word.
Changed in version 3.2.
The $text
operator defaults to the case insensitivity of the text index:
text
index are case insensitive for Latin characters without diacritic marks; i.e. for [A-z]
.$caseSensitive
Option¶To support case sensitive search where the text
index is case insensitive, specify $caseSensitive: true
.
When performing a case sensitive search ($caseSensitive: true
)
where the text
index is case insensitive, the $text
operator:
text
index for case insensitive and diacritic matches.$text
query operation includes an additional stage to filter out the documents that do not match the specified case.For case sensitive search (i.e. $caseSensitive: true
), if the suffix stem contains uppercase letters, the $text
operator matches on the exact word.
Specifying $caseSensitive: true
may impact performance.
See also参阅
Changed in version 3.2.
The $text
operator defaults to the diacritic insensitivity of the text index:
é
, ê
, and e
.text
index are diacritic sensitive.$diacriticSensitive
Option¶To support diacritic sensitive text search against the version 3 text
index, specify $diacriticSensitive: true
.
Text searches against earlier versions of the text
index are inherently diacritic sensitive and cannot be diacritic insensitive. As such, the $diacriticSensitive
option for the $text
operator has no effect with earlier versions of the text
index.
To perform a diacritic sensitive text search ($diacriticSensitive:
true
) against a version 3 text
index, the $text
operator:
text
index, which is diacritic insensitive.$text
query operation includes an additional stage to filter out the documents that do not match.Specifying $diacriticSensitive: true
may impact performance.
To perform a diacritic sensitive search against an earlier version of the text
index, the $text
operator searches the text
index which is diacritic sensitive.
For diacritic sensitive search, if the suffix stem contains the diacritic mark or marks, the $text
operator matches on the exact word.
See also参阅
The $text
operator assigns a score to each document that contains the search term in the indexed fields. The score represents the relevance of a document to a given text search query. The score can be part of a sort()
method specification as well as part of the projection expression. The { $meta: "textScore" }
expression provides information on the processing of the $text
operation. See $meta
projection operator for details on accessing the score for projection or sort.
The following examples assume a collection articles
that has a version 3 text index on the field subject
:
Populate the collection with the following documents:
The following query specifies a 以下查询指定了一个$search
string of coffee
:$search
字符串coffee
:
This query returns the documents that contain the term coffee
in the indexed subject
field, or more precisely, the stemmed version of the word:
See also参阅
If the search string is a space-delimited string, $text
operator performs a logical OR
search on each term and returns documents that contains any of the terms.
The following query specifies a $search
string of three terms delimited by space, "bake coffee cake"
:
This query returns documents that contain either bake
or coffee
or cake
in the indexed subject
field, or more precisely, the stemmed version of these words:
See also参阅
To match the exact phrase as a single term, escape the quotes.要将准确的短语作为一个术语进行匹配,请跳过引号。
The following query searches for the phrase coffee shop
:
This query returns documents that contain the phrase 此查询返回包含短语coffee shop
:coffee shop
的文档:
See also参阅
A negated term is a term that is prefixed by a minus sign -
. If you negate a term, the $text
operator will exclude the documents that contain those terms from the results.
The following example searches for documents that contain the words coffee
but do not contain the term shop
, or more precisely the stemmed version of the words:
The query returns the following documents:
See also参阅
Use the optional $language
field in the $text
expression to specify a language that determines the list of stop words and the rules for the stemmer and tokenizer for the search string.
If you specify a language value of "none"
, then the text search uses simple tokenization with no list of stop words and no stemming.
The following query specifies es
, i.e. Spanish, as the language that determines the tokenization, stemming, and stop words:
The query returns the following documents:
The $text
expression can also accept the language by name, spanish
. See Text Search Languages for the supported languages.
See also参阅
Changed in version 3.2.
The $text
operator defers to the case and diacritic insensitivity of the text
index. The version 3 text
index is diacritic insensitive and expands its case insensitivity to include the Cyrillic alphabet as well as characters with diacritics. For details, see text Index Case Insensitivity and text Index Diacritic Insensitivity.
The following query performs a case and diacritic insensitive text search for the terms сы́рники
or CAFÉS
:
Using the version 3 text
index, the query matches the following documents.
With the previous versions of the text
index, the query would not match any document.
See also参阅
Case Insensitivity, Diacritic Insensitivity, Stemmed Words, Text Indexes
Changed in version 3.2.
To enable case sensitive search, specify $caseSensitive: true
. Specifying $caseSensitive: true
may impact performance.
The following query performs a case sensitive search for the term Coffee
:
The search matches just the document:
The following query performs a case sensitive search for the phrase Café Con Leche
:
The search matches just the document:
A negated term is a term that is prefixed by a minus sign -
. If you negate a term, the $text
operator will exclude the documents that contain those terms from the results. You can also specify case sensitivity for negated terms.
The following example performs a case sensitive search for documents that contain the word Coffee
but do not contain the lower-case term shop
, or more precisely the stemmed version of the words:
The query matches the following document:
See also参阅
Changed in version 3.2.
To enable diacritic sensitive search against a version 3 text index, specify $diacriticSensitive: true
. Specifying $diacriticSensitive: true
may impact performance.
The following query performs a diacritic sensitive text search on the term CAFÉ
, or more precisely the stemmed version of the word:
The query only matches the following document:
The $diacriticSensitive
option applies also to negated terms. A negated term is a term that is prefixed by a minus sign -
. If you negate a term, the $text
operator will exclude the documents that contain those terms from the results.
The following query performs a diacritic sensitive text search for document that contains the term leches
but not the term cafés
, or more precisely the stemmed version of the words:
The query matches the following document:
The following query performs a text search for the term cake
and uses the $meta
operator in the projection document to append the relevance score to each matching document:
The returned document includes an additional field score
that contains the document’s relevance score:
See also参阅
{ $meta:
"textScore" }
expression in the sort()
without also specifying the expression in the projection. For example,
As a result, you can sort the resulting documents by their search relevance without projecting the textScore
.
{ $meta: "textScore" }
expression in the sort()
, you must also include the same expression in the projection.{ $meta:
"textScore" }
expression in both the projection and sort()
, the projection and sort documents can have different field names for the expression.
score
for the expression and the sort()
uses the field named ignoredName
.In previous versions of MongoDB, if { $meta: "textScore" }
is included in both the projection and sort, you must specify the same field name for the expression.
$meta
expression in both the projection document and the sort expression. The following query searches for the term coffee
and sorts the results by the descending score:
The query returns the matching documents sorted by descending score.
See also参阅
Use the limit()
method in conjunction with a sort()
to return the top n
matching documents.
The following query searches for the term coffee
and sorts the results by the descending score, limiting the results to the top two matching documents:
See also参阅
The following query searches for documents where the author
equals "xyz"
and the indexed field subject
contains the terms coffee
or bake
. The operation also specifies a sort order of ascending date
, then descending text search score:
See also参阅