Elasticserach Tips

2012 年 2 月 2 日

elasticsearch升级到7.x；改动不小，命令从头再捋一遍；

文档操作

增加一条记录

PUT /website/_doc/1
{
  "title": "My 2 blog entry",
  "text":  "I am starting to get the hang of this...",
  "date":  "2014/01/02"
}

修改

POST /website/_update/1
{
   "doc" : {
      "tags" : [ "testing..." ],
      "views": 0
   }
}

查询

GET /website/_search

GET /website/_source/1

GET /website/_mget 
{
    "ids" : [ "2", "1" ]    
}

GET /_search
{
    "query": YOUR_QUERY_HERE
}

删除

DELETE /website/_doc/1

文档功能API

获取映射信息

GET /website/_mapping

测试分析器

GET /website/_analyze
{
  "field": "tweet",
  "text": "Black-cats" 
}

多层级对象用扁平化的方法来存储，比如

{
  "gb": {
    "tweet": { 
      "properties": {
        "tweet":            { "type": "string" },
        "user": { 
          "type":             "object",
          "properties": {
            "id":           { "type": "string" },
            "gender":       { "type": "string" },
            "age":          { "type": "long"   },
            "name":   { 
              "type":         "object",
              "properties": {
                "full":     { "type": "string" },
                "first":    { "type": "string" },
                "last":     { "type": "string" }
              }
            }
          }
        }
      }
    }
  }
}

会被转换为如下内部对象:

{
    "tweet":            [elasticsearch, flexible, very],
    "user.id":          [@johnsmith],
    "user.gender":      [male],
    "user.age":         [26],
    "user.name.full":   [john, smith],
    "user.name.first":  [john],
    "user.name.last":   [smith]
}

内部对象数组会丢失一部分相关信息，我们需要用嵌套对象(nested object)来处理

查询语句的结构

一个查询语句的典型结构：

{
    QUERY_NAME: {
        ARGUMENT: VALUE,
        ARGUMENT: VALUE,...
    }
}

如果是针对某个字段，那么它的结构如下：

{
    QUERY_NAME: {
        FIELD_NAME: {
            ARGUMENT: VALUE,
            ARGUMENT: VALUE,...
        }
    }
}

一条复合语句

{
    "bool": {
        "must": { "match":   { "email": "business opportunity" }},
        "should": [
            { "match":       { "starred": true }},
            { "bool": {
                "must":      { "match": { "folder": "inbox" }},
                "must_not":  { "match": { "spam": true }}
            }}
        ],
        "minimum_should_match": 1
    }
}

排障

GET /website/_validate/query?explain
{
   "query": {
      "match" : {
         "text" : "really powerful"
      }
   }
}

结果排序

GET /website/_search
{
    "query" : {
        "bool" : {
            "filter" : { "term" : { "_id" : 1 }}
        }
    },
    "sort": { "date": { "order": "desc" }}
}

索引操作

增加

PUT /my_index
{
    "settings": { ... any settings ... },
    "mappings": {
        "type_one": { ... any mappings ... },
        "type_two": { ... any mappings ... },
        ...
    }
}

删除

DELETE /my_index
DELETE /index_one,index_two
DELETE /index_*
DELETE /_all

配置

number_of_shards

每个索引的主分片数，默认值是 5 。这个配置在索引创建后不能修改。

number_of_replicas

每个主分片的副本数，默认值是 1 。对于活动的索引库，这个配置可以随时修改。

重新索引

POST _reindex
{
  "source": {
    "index": "twitter"
  },
  "dest": {
    "index": "new_twitter"
  }
}

释放空间

POST /_all/_forcemerge?only_expunge_deletes=true

M	T	W	T	F	S	S
« Jan
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

演道网

Elasticserach Tips

文档操作

增加一条记录

修改

查询

删除

文档功能API

获取映射信息

测试分析器

多层级对象用扁平化的方法来存储，比如

内部对象数组会丢失一部分相关信息，我们需要用嵌套对象(nested object)来处理

查询语句的结构

排障

结果排序

索引操作

增加

删除

配置

重新索引

释放空间

About The Author

shine

文档操作

增加一条记录

修改

查询

删除

文档功能API

获取映射信息

测试分析器

多层级对象用扁平化的方法来存储，比如

内部对象数组会丢失一部分相关信息，我们需要用嵌套对象(nested object)来处理

查询语句的结构

排障

结果排序

索引操作

增加

删除

配置

重新索引

释放空间

Related Posts

About The Author

shine