Elasticsearch第10节基本查询

2019-06-15 本文已影响0人小超_8b2f

一、英文检索

1. term和terms查询

term query 会去 倒排索引中寻找 确切的term，它并 不知道分词器的存在（指对查询参数值进行分词）。这种查询适合keyword、numeric、date类型数据的查询

//
#term：查询某个字段里含有**某个**关键词的文档：
GET /myindex/_search
{
  "query":{
    "term":{"intrest":"xi"}
  }
}
#terms:查询某个字段里含有**多个**关键词的文档
GET /myindex/_search
{
  "query":{
    "terms":{"intrest":["xi", "huan"]} # contains anyone
  }
}
# 分页展示，并且显示版本号
GET /myindex/_search
{
  "from":0,
  "size":3,
  "version":true,      #显示版本号
  "query":{
    "terms":{
      "intrest":["xi", "huan"]
    } 
  }
}

2. match查询

match查询知道分词器的存在，会对查询条件的字段值先进行分词，然后再查询。

//
GET /myindex/_search
{
  "from":0,
  "size":3,
  "version":true,   
  "query":{
    "match":{
      "intrest":"basketball running"  #查询包含basketball 或 running的
    } 
  }
}

mach_all 查所有

GET /myindex/_search
{
  "from":0,
  "size":3,
  "version":true,   
  "query":{
    "match_all":{ } 
  }
}

# content 或 intrest字段中包含"tool"的就会被查到
GET /myindex/_search
{
  "query":{
    "multi_match": {
      "query": "tool",
      "fields": ["content","intrest"]
    } 
  }
}

#短语匹配查询
# 必须包含"is a new",顺序不能变
GET /myindex/_search
{
  "query":{
    "match_phrase": {
      "content": "is a new" 
    } 
  }
}

#前缀匹配查询（不区分大小写）
GET /myindex/_search
{
  "query":{
    "match_phrase_prefix": {
      "content": "Java" 
    } 
  }
}

# _source：限定查询结果的字段
GET /myindex/_search
{
  "_source": ["tile","post_date"], 
  "query":{
    "match_all":{ } 
  }
}


# _source includes excludes：限定查询结果的字段 包含谁和 不包含谁，可以用通配符
GET /myindex/_search
{
  "_source": {
    "includes": ["conte*"],
    "excludes": ["post_date"]
  }, 
  "query":{
    "match_all": {}
  }
}



# sort：排序 
GET /myindex/_search
{
  "query":{
    "match_all": {}
  }, "sort": [
    {
      "post_date": {
        "order": "desc"
      }
    }
  ]
}


# range 范围查询
GET /myindex/_search
{
  "query":{
    "range": {
      "post_date": {
        "gte": "2019-06-10",
        "lte": "2019-06-15",
        "include_lower":true,
        "include_upper":false
      }
    }
  }
}

wildcard:通配符查询

//
# wildcard:通配符查询 
GET /myindex/_search
{
  "query":{
    "wildcard": {
      "content": {
        "value": "like" # li?e   li*
      }
    }
  }
}

fuzzy实现模糊查询

value：查询的关键字
boost：查询的权值，默认是1
min_similarity：设置匹配的最小相似度，默认值是0.5，对于字符串，取[0-1]，对于数值，取值可能大于1，对于日期型，1d(1天) 1m （1分钟）1h （1小时）

prefix_length：指明区分词项的共同前缀长度，默认是0
max_expansions：查询中的词项可以扩展的数目，默认可以无限大

//
#fuzzy模糊查询，baksketball却个s:baketball，但是也查出来了
GET /myindex/_search
{
  "query":{
    "fuzzy": {
      "intrest": "baketball"
    }
  }
}

#fuzzy模糊查询第二种写法
GET /myindex/_search
{
  "query":{
    "fuzzy": {
      "intrest": {"value":"baketball"}
    }
  }
}

highlight ：高亮显示

//
#highlight ： 高亮显示
GET /myindex/_search
{
  "query":{
    "fuzzy": {
      "intrest": {"value":"baketball"}
    }
  },
  "highlight": {
    "fields": {"intrest":{}}
  }
}

二、中文检索

安装中文分词器插件：elasticsearch-analysis-ik（第5节倒排索引、分词器）
搜索方法的调用和上面的一样，只不过将搜索关键字换为中文。

#ik带有2个分词器
#ik_max_word : 会将文本做最细粒度的拆分；尽可能多的拆分出词语。
#ik_smart：会做最粗粒度的拆分；已被分出的词语将不会再次被其它词语占有。
//手动创建mapping
DELETE lib5
#手动创建mapping
PUT /lib5
{
  "settings": {
    "number_of_shards": 3,
    "number_of_replicas": 0
  },
  "mappings": {
    "properties": {
      "id":{"type" : "long"},
      "age" : {"type" : "integer"},
      "birthday" : {"type" : "date","index": false},  #不希望建立倒排索引
      "name" : {"type" : "text", "analyzer" : "ik_max_word" },
      "address" : {"type":"text","analyzer" : "ik_max_word"},
      "price" : {"type" : "double"},
    }
  }
}
GET /lib5/_mapping

Elasticsearch第10节基本查询

一、英文检索

1. term和terms查询

2. match查询

mach_all 查所有

wildcard:通配符查询

fuzzy实现模糊查询

highlight ：高亮显示

二、中文检索

猜你喜欢

热点阅读

Elasticsearch第10节 基本查询

一、英文检索

1. term和terms查询

2. match查询

mach_all 查所有

wildcard:通配符查询

fuzzy实现模糊查询

highlight ： 高亮显示

二、 中文检索

猜你喜欢

热点阅读

Elasticsearch第10节基本查询

highlight ：高亮显示

二、中文检索