Making Postgres and Elasticsearch work together like it's 2021

ZomboDB

Last update: Jan 2, 2023

Related tags

Database elasticsearch postgres sql postgresql text-search

Overview

Making Postgres and Elasticsearch work together like it's 2021

Readme

ZomboDB brings powerful text-search and analytics features to Postgres by using Elasticsearch as an index type. Its comprehensive query language and SQL functions enable new and creative ways to query your relational data.

From a technical perspective, ZomboDB is a 100% native Postgres extension that implements Postgres' Index Access Method API. As a native Postgres index type, ZomboDB allows you to CREATE INDEX ... USING zombodb on your existing Postgres tables. At that point, ZomboDB takes over and fully manages the remote Elasticsearch index and guarantees transactionally-correct text-search query results.

ZomboDB is fully compatible with all of Postgres' query plan types and most SQL commands such as CREATE INDEX, COPY, INSERT, UPDATE, DELETE, SELECT, ALTER, DROP, REINDEX, (auto)VACUUM, etc.

It doesn’t matter if you’re using an Elasticsearch cloud provider or managing your own cluster -- ZomboDB communicates with Elasticsearch via its RESTful APIs so you’re covered either way.

ZomboDB allows you to use the power and scalability of Elasticsearch directly from Postgres. You don’t have to manage transactions between Postgres and Elasticsearch, asynchronous indexing pipelines, complex reindexing processes, or multiple data-access code paths -- ZomboDB does it all for you.

Quick Links

Features

MVCC-correct text-search and aggregation results
Managed and queried via standard SQL
Works with current Elasticsearch releases (no plugins required)
Query using
- Elasticsearch's Query String Syntax via dsl.query_string()
- ZQL -- ZomboDB's custom query language
- Raw Elasticsearch QueryDSL JSON
- ZomboDB's type-safe query builder SQL syntax
- Any combination of the above, even in combination with standard SQL
Scoring and Highlighting Support
Support for all Elasticsearch aggregations
Automatic Elasticsearch Mapping Generation
- Ability to map custom domains
- Per-field custom mappings
- json/jsonb automatically mapped as dynamic nested objects
- Supports full set of Elasticsearch language analyzers
- Supports Elasticsearch's Similarity Module
Hot-Standby compatible
Support for indexing & searching PostGIS geometry and geography types

Current Limitations

Only one ZomboDB index per table
ZomboDB indexes with predicates (ie, partial indexes) are not supported
CREATE INDEX CONCURRENTLY is not supported

These limitations may be addressed in future versions of ZomboDB.

System Requirements

Product	Version
Postgres	10.x, 11.x, 12.x, 13.x
Elasticsearch	7.x

Sponsorship and Downloads

Please see https://github.com/sponsors/eeeebbbbrrrr for sponsorship details. Your sponsorship at any tier is greatly appreciated and helps keep ZomboDB moving forward.

Note that ZomboDB is only available in binary form for certain sponsor tiers.

When you become a sponsor at a tier that provides binary downloads, please request a download key from https://www.zombodb.com/services/. Please do the same if you sponsor a tier that provides access to ZomboDB's private Discord server.

Quick Overview

Note that this is just a quick overview. Please read the getting started tutorial for more details.

Create the extension:

CREATE EXTENSION zombodb;

Create a table:

CREATE TABLE products (
    id SERIAL8 NOT NULL PRIMARY KEY,
    name text NOT NULL,
    keywords varchar(64)[],
    short_summary text,
    long_description zdb.fulltext, 
    price bigint,
    inventory_count integer,
    discontinued boolean default false,
    availability_date date
);

-- insert some data

Create a ZomboDB index:

CREATE INDEX idxproducts 
          ON products 
       USING zombodb ((products.*)) 
        WITH (url='localhost:9200/');

Query it:

SELECT * 
  FROM products 
 WHERE products ==> '(keywords:(sports OR box) OR long_description:"wooden away"~5) AND price:[1000 TO 20000]';

Contact Information

https://www.zombodb.com
Google Group: [email protected]
Twitter: @zombodb
via Github Issues and Pull Requests
https://www.zombodb.com/services/ or [email protected] for commercial support

History

The name is an homage to zombo.com and its long history of continuous self-affirmation.

Historically, ZomboDB began in 2013 by Technology Concepts & Design, Inc as a closed-source effort to provide transaction safe text-search on top of Postgres tables. While Postgres' "tsearch" features are useful, they're not necessarily adequate for 200 column-wide tables with 100M rows, each containing large text content.

Initially designed on-top of Postgres' Foreign Data Wrapper API, ZomboDB quickly evolved into an index type so that queries are MVCC-safe and standard SQL can be used to query and manage indices.

Elasticsearch was chosen as the backing search index because of its horizontal scaling abilities, performance, and general ease of use.

ZomboDB was open-sourced in July 2015 and has since been used in numerous production systems of various sizes and complexity.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Comments

Investigate mapping some "PostGIS" data types to ES "geo shape" type

I'd like to see how difficult it might be to translate some of the basic PostGIS types into ES "geo shape" types, just for indexing purposes.

I'm not aiming for complete compatibility with PostGIS (at least not right now), but being able to index some of its data types in a way that can be queried using ZDB's direct JSON QueryDSL syntax would likely be useful.
enhancement (merged) v4.0

opened by eeeebbbbrrrr 92

Corruption in ES Index

So we were running the below query to verify some of the "health" of our index and noticed this:

db=# select *,case when zdb=estimate and zdb=pg then 'ok' else 'ERROR' end as my_stat from
 (select count(*) as zdb from gc.cv_data where zdb('gc.cv_data',cv_data.ctid)==>'pk_data_id: 1 /TO/ 100000')z
 ,(select zdb_estimate_count as estimate from zdb_estimate_count('gc.cv_data','pk_data_id: 1 /TO/ 100000'))e
 ,(select count(*) as pg from gc.cv_data WHERE pk_data_id BETWEEN 1 AND 100000)p
 ,(select zdb_estimate_count as null_estimate from zdb_estimate_count('gc.cv_data','pk_data_id:NULL'))ne;
  zdb   | estimate |  pg   | null_estimate | my_stat
--------+----------+-------+---------------+---------
 100021 |    99996 | 99996 |             0 | ERROR
(1 row)

Time: 241.582 ms

The zdb count is off!?!

We narrowed down one of the pk_data_id's:

db=# SELECT ctid, xmin, xmax, pk_data_id FROM gc.cv_data WHERE pk_data_id = 97302;
   ctid   |   xmin    |   xmax    | pk_data_id
----------+-----------+-----------+------------
 (8733,4) | 147604115 | 188561882 |      97302
(1 row)

Time: 0.356 ms


db=# SELECT ctid, xmin, xmax, pk_data_id FROM gc.cv_data WHERE zdb('gc.cv_data', ctid) ==> 'pk_data_id: 97302 /TO/ 97302';
     ctid     |   xmin    |   xmax    | pk_data_id
--------------+-----------+-----------+------------
 (8733,4)     | 147604115 | 188561882 |      97302
 (1322553,11) | 188570594 | 188570594 |    9706852
(2 rows)

Time: 15.467 ms
db=# SELECT ctid, xmin, xmax, pk_data_id FROM gc.cv_data WHERE zdb('gc.cv_data', ctid) ==> 'pk_data_id: 97302';
     ctid     |   xmin    |   xmax    | pk_data_id
--------------+-----------+-----------+------------
 (8733,4)     | 147604115 | 188561882 |      97302
 (1322553,11) | 188570594 | 188570594 |    9706852
(2 rows)

Time: 12.808 ms

Autovacuum/vacuum are up-to-date. What's going on here? What other info can we provide? This is BAD!

bug v3.1.15

opened by taspotts 79

WIP: ZomboDB v4.0 (Support Elasticsearch 2.4)

This PR is based on the work done by @pashinin in PR #140.

It's been upgraded to support ES 2.4.1 along with including all the changes in ZDB v3.1.

I need to fix a few junit/regression tests that are failing and spend some time analyzing the performance of ES 2.4 relative to 1.7.5.

opened by eeeebbbbrrrr 47

[question] how to use zdb.score with an index on a function

ZomboDB version: 3000.0.0-alpha3 Postgres version: 12.4 Elasticsearch version: 7.10.1

Problem Description:

when using zdb.score() with a zombodb index using a function, it returns 0 for each record

Table Schema/Index Definition:

To reproduce the problem:

drop table if exists test_zombodb cascade;
create table test_zombodb ( id integer not null primary key);

insert into test_zombodb (id) values (1);

drop type if exists test_zombodb_es cascade;
create type test_zombodb_es as (id integer, text text);

drop function if exists index(test_zombodb);

create function index(test_zombodb test_zombodb) returns test_zombodb_es as $$
  select row(test_zombodb.id, 'some text')::test_zombodb_es
$$ language sql immutable strict;

drop index if exists idx_es_test_zombodb;

create index idx_es_test_zombodb
  on test_zombodb using zombodb(index(test_zombodb))
  with (url='http://elasticsearch:9200/');

select zdb.score(ctid), index(test_zombodb) from test_zombodb where index(test_zombodb) ==> 'text'

Output from select zdb.index_mapping('index_name');:

{"105155.107344.117161.117170": {"mappings": {"properties": {"id": {"type": "integer"}, "text": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true, "index_prefixes": {"max_chars": 5, "min_chars": 2}}, "zdb_all": {"type": "text", "analyzer": "zdb_all_analyzer"}, "zdb_cmax": {"type": "integer"}, "zdb_cmin": {"type": "integer"}, "zdb_ctid": {"type": "long"}, "zdb_xmax": {"type": "long"}, "zdb_xmin": {"type": "long"}, "zdb_aborted_xids": {"type": "long"}}, "date_detection": false, "dynamic_templates": [{"strings": {"mapping": {"type": "keyword", "copy_to": "zdb_all", "normalizer": "lowercase", "ignore_above": 10922}, "match_mapping_type": "string"}}, {"dates_times": {"mapping": {"type": "date", "format": "strict_date_optional_time||epoch_millis||HH:mm:ss.S||HH:mm:ss.SX||HH:mm:ss.SS||HH:mm:ss.SSX||HH:mm:ss.SSS||HH:mm:ss.SSSX||HH:mm:ss.SSSS||HH:mm:ss.SSSSX||HH:mm:ss.SSSSS||HH:mm:ss.SSSSSX||HH:mm:ss.SSSSSS||HH:mm:ss.SSSSSSX", "copy_to": "zdb_all"}, "match_mapping_type": "date"}}, {"objects": {"mapping": {"type": "nested", "include_in_parent": true}, "match_mapping_type": "object"}}], "numeric_detection": false}}}

bug (merged) v3000.0.0-alpha3

opened by mathroc 41

_all field misbehaves when used across index links
I have four tables, each with ZDB index created. I added "index links" and created view for search as stated in INDEX OPTIONS. And here is the situation: As soon as I search like this:

select * from test_view where zdb ==> 'John'

Everything is fine, I get all records where John is present no matter of index. But if I change select to look like this:

select * from test_view where zdb ==> 'John Doe'

Then I get only records where John and Doe are present in the same index and not the records where John is in one index and Doe in another.

One way to solve this is to create materialized view and build index on it, but refreshing 2.5M records materialized view takes approximately 5 minutes, that's really bad.

So, question is like this: can I create a view where ZDB query will search for records where words can be in any of linked inexes without explicitly defining fields for search?

ZomboDB 3.1.3, PostgreSQL 9.5.5, ElasticSearch 1.7.6.
bug v3.1.7 v3.1.11
opened by Real-Gecko 39
Returning large results is slow

ZomboDB version: zombodb_debian-buster_pg12-3000.0.0-alpha1_amd64 (self-built) Postgres version: 12.4 Elasticsearch version: 7.7.1

Problem Description:

We'd like to know if there is a way to activate debug logging to know what's happening behind the scenes.

Our problem is that we're trying to run a long query and we don't know where most of the time is spent. We tried to execute the ES subqueries independently and they're fast so I don't think ES is the problem. Plus, the same query is running up to 80 times slower in zombo than in ES. After the ES queries are done, postgres runs but using only ~5% of the cpu.

We already tried zdb.log_level but it doesn't seem to change much.
bug (merged) v3000.0.0-alpha3

opened by Jasopaum 37

Using additional conditions in WHERE together with zombodb

ZomboDB version: v3000.0.11 Postgres version: 13.4 Elasticsearch version: 8.1.2

Problem Description: Hello again! I'm trying to use a zombodb index with other btree type indexes in the one query. In EXPLAIN, I expect to see something of the two options below:

Bitmap Index Scan between zombodb and btree indexes;
Index Scan by zombodb index and then Filter instead of using btree index.

However, I see a completely different situation instead. When trying to search only the zombodb index:

EXPLAIN
SELECT * FROM test_tbl
  WHERE test_fts_idx_func(col_a) ==> 'foo bar';

Result:

Index Scan using test_fts_idx on "test_tbl"  (cost=0.00..529103525.16 rows=6151442944 width=364)
  Index Cond: ("col_a" ==> '{""query_string"":{""query"":""foo bar""}}'::zdbquery)
JIT:
  Functions: 4
  Options: Inlining true, Optimization true, Expressions true, Deforming true

Everything is great now. When trying to use zombodb and btree index simultaneously (built by column col_b):

EXPLAIN
SELECT * FROM test_tbl
  WHERE test_fts_idx_func(col_a) ==> 'foo bar'
    AND col_b = 'foo bar';

Result:

Gather  (cost=1000.71..623028.95 rows=304674 width=364)
  Workers Planned: 48
  ->  Parallel Index Scan using test_fymd_idx on "test_tbl"  (cost=0.71..591561.55 rows=6347 width=364)
        Index Cond: ("col_b" = 'foo bar'::text)
        Filter: ("col_a" ==> '{""query_string"":{""query"":""foo bar""}}'::zdbquery)
JIT:
  Functions: 6
  Options: Inlining true, Optimization true, Expressions true, Deforming true

The zombodb index is not used. Same for additional conditions in WHERE, which do not affect other indexes (except zombodb):

EXPLAIN
SELECT * FROM test_tbl
  WHERE test_fts_idx_func(col_a) ==> 'foo bar'
    AND col_c = 5;

Result:

Gather  (cost=1000.00..505959721.55 rows=351657483 width=364)
  Workers Planned: 48
  ->  Parallel Seq Scan on "test_tbl"  (cost=0.00..470792973.25 rows=7326198 width=364)
        Filter: (("col_c" = 5) AND ("col_a" ==> '{""query_string"":{""query"":""foo bar""}}'::zdbquery))
JIT:
  Functions: 4
  Options: Inlining true, Optimization true, Expressions true, Deforming true

* test_fts_idx_func() here is used to index only one column according to the documentation.

As we can see, the zombodb index is not involved in the query at all. Actually, the question is, is it even possible to use zombodb and btree indices together? Or, at least, is it possible to use a zombodb index with an additional condition in WHERE? The documentation says that this seems to be possible. However, I cannot achieve these results.

bug (merged) v3000.0.12

opened by hyperion-cs 34

Intermittent index out of bounds error on write

ZomboDB version: v3000.0.8 Postgres version: 13.7 Elasticsearch version: 7.17.0 Problem Description: We have an intermittent error that occurs on write.

Error Message (if any):

error: code=Some(200), {"error":null,"errors":true,"items":[{"update":{"error":{"type":"illegal_argument_exception","reason":"failed to execute script","caused_by":{"type":"script_exception","reason":"runtime error","script_stack":["java.base/jdk.internal.util.Preconditions.outOfBounds([Preconditions.java:64](http://preconditions.java:64/))","java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex([Preconditions.java:70](http://preconditions.java:70/))","java.base/jdk.internal.util.Preconditions.checkIndex([Preconditions.java:266](http://preconditions.java:266/))","java.base/java.util.Objects.checkIndex([Objects.java:359](http://objects.java:359/))","java.base/java.util.ArrayList.remove([ArrayList.java:504](http://arraylist.java:504/))","ctx._source.zdb_aborted_xids.remove(ctx._source.zdb_aborted_xids.indexOf(params.XID));","                                                                               ^---- HERE"],"script":"ctx._source.zdb_aborted_xids.remove(ctx._source.zdb_aborted_xids.indexOf(params.XID));","lang":"painless","position":{"offset":79,"start":0,"end":86},"caused_by":{"type":"index_out_of_bounds_exception","reason":"Index -1 out of bounds for length 2"}}}}}]}
[1]     at Parser.parseErrorMessage (/home/brent/projects/steel-ui/steelhead-deploy/gosteelhead/node_modules/

Table Schema/Index Definition: -- complex, not sure of a mwe.

CREATE FUNCTION steelhead.search_recipe_node_idx(steelhead.recipe_node) RETURNS steelhead.search_recipe_node_idx_type IMMUTABLE STRICT LANGUAGE sql AS $$
SELECT ROW (
           $1.id,
           $1.name,
           $1.description_markdown::text,
           $1.operator_input_json_schema::text,
           $1.derived_from,
           $1.treatment_id,
           'recipe node'
           )::steelhead.search_recipe_node_idx_type;
$$;

CREATE INDEX search_recipe_node_idx 
      ON steelhead.recipe_node 
      USING zombodb (steelhead.search_recipe_node_idx(recipe_node))
         WITH (url='http://elasticsearch:9200/');
CREATE TYPE steelhead.search_work_order_idx_type AS (
    id integer, 
    name text, 
    id_in_domain text,
    customer_name text,
    received_order_id integer,
    recipe_id integer,
    product_id integer,
    created_at timestamptz,
    creator_id integer,
    custom_inputs text,
    entity text
);  

CREATE FUNCTION steelhead.search_work_order_idx(steelhead.work_order) RETURNS steelhead.search_work_order_idx_type IMMUTABLE STRICT LANGUAGE sql AS $$
SELECT ROW (
           $1.id,
           $1.name,
           concat('WO',$1.id_in_domain::text),
           (select name from steelhead.customer where id = $1.customer_id),
           $1.received_order_id,
           $1.recipe_id,
           $1.product_id,
           $1.created_at,
           $1.creator_id,
            $1.custom_inputs,
           'work orders'
           )::steelhead.search_work_order_idx_type;
$$;

CREATE INDEX search_work_order_idx 
      ON steelhead.work_order 
      USING zombodb (steelhead.search_work_order_idx(work_order))
         WITH (url='http://elasticsearch:9200/');
GRANT EXECUTE ON FUNCTION steelhead.search_work_order_idx to steelhead_authed;

ALTER INDEX search_work_order_idx SET (options='parts_transfer:(id=<steelhead.parts_transfer.search_parts_transfer_idx>to_work_order_id),
        part_number:(parts_transfer.part_number_id=<steelhead.part_number.search_part_number_idx>id),
        creator:(creator_id=<steelhead.user.search_user_idx>id),
        received_order:(received_order_id=<steelhead.received_order.search_received_order_idx>id),
        ro_creator:(received_order.creator_id=<steelhead.user.search_user_idx>id),
        recipe:(recipe_id=<steelhead.recipe_node.search_recipe_node_idx>id),
        product:(product_id=<steelhead.product.search_product_idx>id)');

Output from select zdb.index_mapping('index_name');: here is the output from the two indicies I would expect to be responsible.

{"122906.123633.124008.130171": {"mappings": {"properties": {"id": {"type": "integer"}, "name": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "entity": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "zdb_all": {"type": "text", "analyzer": "zdb_all_analyzer"}, "zdb_cmax": {"type": "integer"}, "zdb_cmin": {"type": "integer"}, "zdb_ctid": {"type": "long"}, "zdb_xmax": {"type": "long"}, "zdb_xmin": {"type": "long"}, "derived_from": {"type": "integer"}, "treatment_id": {"type": "integer"}, "zdb_aborted_xids": {"type": "long"}, "description_markdown": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "operator_input_json_schema": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}}, "date_detection": false, "dynamic_templates": [{"strings": {"mapping": {"type": "keyword", "copy_to": "zdb_all", "normalizer": "lowercase", "ignore_above": 10922}, "match_mapping_type": "string"}}, {"dates_times": {"mapping": {"type": "keyword", "fields": {"date": {"type": "date", "format": "strict_date_optional_time||epoch_millis||HH:mm:ss.S||HH:mm:ss.SX||HH:mm:ss.SS||HH:mm:ss.SSX||HH:mm:ss.SSS||HH:mm:ss.SSSX||HH:mm:ss.SSSS||HH:mm:ss.SSSSX||HH:mm:ss.SSSSS||HH:mm:ss.SSSSSX||HH:mm:ss.SSSSSS||HH:mm:ss.SSSSSSX"}}, "copy_to": "zdb_all"}, "match_mapping_type": "date"}}, {"objects": {"mapping": {"type": "nested", "include_in_parent": true}, "match_mapping_type": "object"}}], "numeric_detection": false}}}

{"122906.123633.124067.130198": {"mappings": {"properties": {"id": {"type": "integer"}, "name": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "entity": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "zdb_all": {"type": "text", "analyzer": "zdb_all_analyzer"}, "zdb_cmax": {"type": "integer"}, "zdb_cmin": {"type": "integer"}, "zdb_ctid": {"type": "long"}, "zdb_xmax": {"type": "long"}, "zdb_xmin": {"type": "long"}, "recipe_id": {"type": "integer"}, "created_at": {"type": "keyword", "fields": {"date": {"type": "date"}}, "copy_to": ["zdb_all"]}, "creator_id": {"type": "integer"}, "product_id": {"type": "integer"}, "id_in_domain": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "custom_inputs": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "customer_name": {"type": "text", "copy_to": ["zdb_all"], "analyzer": "zdb_standard", "fielddata": true}, "zdb_aborted_xids": {"type": "long"}, "received_order_id": {"type": "integer"}}, "date_detection": false, "dynamic_templates": [{"strings": {"mapping": {"type": "keyword", "copy_to": "zdb_all", "normalizer": "lowercase", "ignore_above": 10922}, "match_mapping_type": "string"}}, {"dates_times": {"mapping": {"type": "keyword", "fields": {"date": {"type": "date", "format": "strict_date_optional_time||epoch_millis||HH:mm:ss.S||HH:mm:ss.SX||HH:mm:ss.SS||HH:mm:ss.SSX||HH:mm:ss.SSS||HH:mm:ss.SSSX||HH:mm:ss.SSSS||HH:mm:ss.SSSSX||HH:mm:ss.SSSSS||HH:mm:ss.SSSSSX||HH:mm:ss.SSSSSS||HH:mm:ss.SSSSSSX"}}, "copy_to": "zdb_all"}, "match_mapping_type": "date"}}, {"objects": {"mapping": {"type": "nested", "include_in_parent": true}, "match_mapping_type": "object"}}], "numeric_detection": false}}}

bug (merged) v3000.1.1

opened by bhalonen 33

Elasticsearch 5 support

Now that the awesome Lucene 6 release is out, Elasticsearch 5 is being prepared. So parallel to working on Postgres 9.5, ZomboDB support should be started in order to allow using these great releases together to provide the fastest full-text search for Postgres-based applications.

opened by rleonhardt 30

#EXPAND with 2 tables

When there are 2 tables in a view...using ES options to relate the indexes.... and the EXPAND field is in the secondary table... AND the initial criteria is on a field in the secondary table... the record set does not EXPAND.

create schema test_expand;

create table test_expand.data(pk_data bigint, data_family_group bigint, data_first_name text, constraint idx_test_expand_data_pkey primary key (pk_data));

create table test_expand.var(pk_var bigint, var_family_group bigint, var_pets text, constraint idx_test_expand_var_pkey primary key (pk_var));

insert into test_expand.data(pk_data, data_family_group, data_first_name) values(1,1,'mark'); 
insert into test_expand.data(pk_data, data_family_group, data_first_name) values(2,1,'eric'); 
insert into test_expand.data(pk_data, data_family_group, data_first_name) values(3,NULL,'terry'); 


insert into test_expand.var(pk_var, var_family_group, var_pets) values(1,1,'dogs'); 
insert into test_expand.var(pk_var, var_family_group, var_pets) values(2,1,'cats'); 
insert into test_expand.var(pk_var, var_pets) values(3,'minions');

CREATE INDEX es_test_expand_var ON test_expand.var USING zombodb (zdb('test_expand.var'::regclass, ctid), zdb(var.*)) 
	WITH (url='http://###.##.##.##:####/', preference=_primary, shards='3', replicas='0');

CREATE INDEX es_test_expand_data ON test_expand.data USING zombodb (zdb('test_expand.data'::regclass, ctid), zdb(data.*)) 
	WITH (url='http://###.##.##.##:####/',options='pk_data = <var.es_test_expand_var>pk_var', preference=_primary, shards='3', replicas='0');
	
CREATE OR REPLACE VIEW test_expand.consolidated_record_view AS  SELECT data.pk_data
	,data.data_family_group
	,data.data_first_name
	,var.var_family_group
	,var.var_pets
    ,zdb('test_expand.data'::regclass, data.ctid) AS zdb
   FROM test_expand.data
     LEFT JOIN test_expand.var ON data.pk_data = var.pk_var;

SELECT * FROM test_expand.consolidated_record_view;
	 
SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<data_family_group=<this.index>data_family_group>( ( data_first_name = "MARK" ) AND )) )';
SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<var_family_group=<this.index>var_family_group>( ( data_first_name = "MARK" ) AND )) )';

SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<data_family_group=<this.index>data_family_group>( ( var_pets = "DOGS" ) AND )) )';
SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<var_family_group=<this.index>var_family_group>( ( var_pets = "DOGS" ) AND )) )';

RESULTS:

dev1_db=# SELECT * FROM test_expand.consolidated_record_view;
 pk_data | data_family_group | data_first_name | var_family_group | var_pets |  zdb
---------+-------------------+-----------------+------------------+----------+-------
       1 |                 1 | mark            |                1 | dogs     | (0,1)
       2 |                 1 | eric            |                1 | cats     | (0,2)
       3 |                   | terry           |                  | minions  | (0,3)
(3 rows)

dev1_db=# SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<data_family_group=<this.index>data_family_group>( ( data_first_name = "MARK" ) AND )) )';
 pk_data | data_family_group | data_first_name | var_family_group | var_pets |  zdb
---------+-------------------+-----------------+------------------+----------+-------
       1 |                 1 | mark            |                1 | dogs     | (0,1)
       2 |                 1 | eric            |                1 | cats     | (0,2)
(2 rows)

dev1_db=# SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<var_family_group=<this.index>var_family_group>( ( data_first_name = "MARK" ) AND )) )';
 pk_data | data_family_group | data_first_name | var_family_group | var_pets |  zdb
---------+-------------------+-----------------+------------------+----------+-------
       1 |                 1 | mark            |                1 | dogs     | (0,1)
       2 |                 1 | eric            |                1 | cats     | (0,2)
(2 rows)

dev1_db=# ^C
dev1_db=# SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<data_family_group=<this.index>data_family_group>( ( var_pets = "DOGS" ) AND )) )';
 pk_data | data_family_group | data_first_name | var_family_group | var_pets |  zdb
---------+-------------------+-----------------+------------------+----------+-------
       1 |                 1 | mark            |                1 | dogs     | (0,1)
       2 |                 1 | eric            |                1 | cats     | (0,2)
(2 rows)

dev1_db=# SELECT * FROM test_expand.consolidated_record_view where zdb==>'( (#expand<var_family_group=<this.index>var_family_group>( ( var_pets = "DOGS" ) AND )) )';
 pk_data | data_family_group | data_first_name | var_family_group | var_pets |  zdb
---------+-------------------+-----------------+------------------+----------+-------
       1 |                 1 | mark            |                1 | dogs     | (0,1)
(1 row)

bug v3.1.2

opened by MarkMatte 29

dsl subquery support
So I have this problem with accurate + fast counting. I have this table that keeps track of who is online right now ( basically just ids ) And this table is fairly small hundreds to thousands.

And I have a table with all of the meta data about all of the users. Large table - Millions. This table has a zombodb index on it. When querying these tables individually, it is fine. I can apply a limit in ES and is generally not a problem.

WHERE table ==> dsl.limit( 100 , dsl.bool(...) )

If I need to join these two tables, I can't apply a limit through elastic search and it is returning everything in the index, on a table with 1M+ things, this is pretty slow, especailly when users first trigger the query, there are no filters to pass through to ES, so it basically just returns everything

SELECT ou.* FROM online_users ou INNER JOIN user_attributes ua on ou.id = ua.user_id WHERE ua ==> '{"bool": {"must": []}'

What I tried to do was pass IDs of the smaller table as subquery

SELECT ou.* FROM online_users ou INNER JOIN user_attributes ua on ou.id = ua.user_id and ou.organization_id = $1 WHERE ua ==> dsl.bool( dsl.must( dsl.terms( 'user_id' , (SELECT id FROM online_users where organzation_id = $1 ) ) ) )

but that doesn't work.
documentation
opened by esatterwhite 26

Function zdb.tally ignores 'stem' parameter for date type

ZomboDB version: 3000.1.7 Postgres version: 14 Elasticsearch version: 7.17.0

Problem Description: New in version 3000.1.7 appears to provide zdb.tally for date field type in raw format that can simply be cast to date. However, the 'stem' parameter does not appear to work as expected.

Error Message (if any): N/A

Table Schema/Index Definition:

CREATE TABLE termlist_issue (pkey serial8, date_combined date);
CREATE INDEX idxtermlistissue ON termlist_issue USING zombodb ((termlist_issue.*)) WITH (url='http://172.20.40.142:8082/');

INSERT INTO termlist_issue (date_combined) VALUES 
  ('2020-05-10'),('2021-08-01'),('2022-03-13'),('1999-12-31'),('1976-07-04');

Reproduce steps:

SELECT * FROM termlist_issue;
 pkey | date_combined
------+---------------
    1 | 2020-05-10
    2 | 2021-08-01
    3 | 2022-03-13
    4 | 1999-12-31
    5 | 1976-07-04
(5 rows)

-- Correct output, stem is '^.*'
SELECT term::date as term, count, term AS exact_term
  FROM zdb.tally('termlist_issue'::regclass, 'date_combined', 'FALSE', '^.*', ''::zdbquery, 5000, 'term'::termsorderby); 

  term    | count |        exact_term
------------+-------+--------------------------
1976-07-04 |     1 | 1976-07-04T00:00:00.000Z
1999-12-31 |     1 | 1999-12-31T00:00:00.000Z
2020-05-10 |     1 | 2020-05-10T00:00:00.000Z
2021-08-01 |     1 | 2021-08-01T00:00:00.000Z
2022-03-13 |     1 | 2022-03-13T00:00:00.000Z
(5 rows)

-- Stem is '^1.*' - expecting two rows
SELECT term::date as term, count, term AS exact_term
 FROM zdb.tally('termlist_issue'::regclass, 'date_combined', 'FALSE', '^1.*', ''::zdbquery, 5000, 'term'::termsorderby);  

 term    | count |        exact_term
------------+-------+--------------------------
1976-07-04 |     1 | 1976-07-04T00:00:00.000Z
1999-12-31 |     1 | 1999-12-31T00:00:00.000Z
2020-05-10 |     1 | 2020-05-10T00:00:00.000Z
2021-08-01 |     1 | 2021-08-01T00:00:00.000Z
2022-03-13 |     1 | 2022-03-13T00:00:00.000Z
(5 rows)

-- Attempt to use `.date` subfield just in case
SELECT term::date as term, count, term AS exact_term
 FROM zdb.tally('termlist_issue'::regclass, 'date_combined.date', 'FALSE', '^1.*', ''::zdbquery, 5000, 'term'::termsorderby);

 term    | count |        exact_term
------------+-------+--------------------------
1976-07-04 |     1 | 1976-07-04T00:00:00.000Z
1999-12-31 |     1 | 1999-12-31T00:00:00.000Z
2020-05-10 |     1 | 2020-05-10T00:00:00.000Z
2021-08-01 |     1 | 2021-08-01T00:00:00.000Z
2022-03-13 |     1 | 2022-03-13T00:00:00.000Z
(5 rows)

opened by Shigoki 0

Nested Proximity Search Hit Highlighting Incorrect

ZomboDB version: 3000.1.7 Postgres version: 14 Elasticsearch version: 7.17.0

Problem Description: Searches using nested proximity search ie. '(foo w/7 (bar w/7 gumby))' retrieve rows correctly.

The function zdb.highlight_document() does not appear to highlight terms according to the nested grouping boundaries.

Error Message (if any): N/A

Table Schema/Index Definition:

CREATE TABLE highlight_issue AS SELECT 'Vallejo has produced film posters for numerous fantasy and action movies, including Knightriders (1981), Q (1982), and Barbarian Queen (1985). He has also illustrated posters for comedies, notably National Lampoons Vacation (1983), European Vacation (1985), Nothing But Trouble (1991) and Aqua Teen Hunger Force Colon Movie Film for Theaters (2007), co-created with Bell.[8]
He created the 1978 Tarzan calendar.[citation needed] His sea serpent paintings hang in the queue of Loch Ness Monster, a rollercoaster at Busch Gardens Williamsburg.' AS t;

CREATE INDEX idxhighlightissue ON highlight_issue USING zombodb ((highlight_issue.*)) WITH (url='http://<es url>/');

Recreate:

SELECT * FROM highlight_issue WHERE highlight_issue ==> 't: ( "film" w/7 ("movies" w/7 "barbarian"))';
t
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Vallejo has produced film posters for numerous fantasy and action movies, including Knightriders (1981), Q (1982), and Barbarian Queen (1985). He has also illustrated posters for comedies, notably National Lampoons Vacation (1983), European Vacation (1985), Nothing But Trouble (1991) and Aqua Teen Hunger Force Colon Movie Film for Theaters (2007), co-created with Bell.[8]+
He created the 1978 Tarzan calendar.[citation needed] His sea serpent paintings hang in the queue of Loch Ness Monster, a rollercoaster at Busch Gardens Williamsburg.
(1 row)

-- The term "film" is not within 7 of "barbarian" but should highlight as part of nested group
WITH highlights AS MATERIALIZED (SELECT (
     zdb.highlight_document('highlight_issue'::regclass, json_build_object('t',t), 't: ( "film" w/7 ("movies" w/7 "barbarian"))'::TEXT)).* FROM highlight_issue)
SELECT * FROM highlights;

field_name | array_index |  term  |    type    | position | start_offset | end_offset |               query_clause
------------+-------------+--------+------------+----------+--------------+------------+-------------------------------------------
t          |           0 | film   | <ALPHANUM> |        4 |           21 |         25 | t:("film" W/7 ("movies" W/7 "barbarian"))
t          |           0 | movies | <ALPHANUM> |       11 |           66 |         72 | t:("film" W/7 ("movies" W/7 "barbarian"))
(2 rows)

-- Adjusted so that term "film" is within 25 of nested group and all three terms are highlighted
WITH highlights AS MATERIALIZED (SELECT (
     zdb.highlight_document('highlight_issue'::regclass, json_build_object('t',t), 't: ( "film" w/25 ("movies" w/7 "barbarian"))'::TEXT)).* FROM highlight_issue)
SELECT * FROM highlights;

field_name | array_index |   term    |    type    | position | start_offset | end_offset |                query_clause
------------+-------------+-----------+------------+----------+--------------+------------+--------------------------------------------
t          |           0 | film      | <ALPHANUM> |        4 |           21 |         25 | t:("film" W/25 ("movies" W/7 "barbarian"))
t          |           0 | movies    | <ALPHANUM> |       11 |           66 |         72 | t:("film" W/25 ("movies" W/7 "barbarian"))
t          |           0 | barbarian | <ALPHANUM> |       18 |          119 |        128 | t:("film" W/25 ("movies" W/7 "barbarian"))
(3 rows)

opened by Shigoki 0

support dense-vector

It would be nice to support storage and query of postgresql vectors as dense-vectors in ES.

Dense vector is very useful in many applications like image search, machine-learning, or recommendation systems.

Currently, postgresql can be used to store float point vectors, but it can not perform a kNN query based on dot_product, l2_norm or cosine distances.

opened by nick008a 0
Implement a method to estimate the number of rows in the index
ZomboDB version: 3000.0.12 Postgres version: 14.x Elasticsearch version: 8.3

This issue has already been mentioned in a neighboring issue (by my mistake). Again:

So, I need extremely fast mechanism (much faster than zdb.count(...) function) to estimate whether a query in ZDB will find more than 10'000 rows or not (don't be scared of this constant - it's inherent in the ES, I'll tell you about it next). Directly in ES this is quite simple: you need to send _search query with size equal to 0. Like this:

curl -X GET "localhost:9200/16881.2200.527888473.530970621/_search?pretty" -H 'Content-Type: application/json' -d' { "size" : 0, "query": { "match": { "some_field": { "query": "el paso 5", "fuzzy_transpositions": false, "auto_generate_synonyms_phrase_query": false, "operator": "and" } } } } '

Example response (insignificant data removed):

{ "hits" : { "total" : { "value" : 2653 } } }

In short, ES in this case returns either the exact count in hits.total.value or a constant of 10'000 if the number of rows found >= 10'000. Basically, it's like _count (aka zdb.count(...)), only the counter stops when it finds 10'000 rows. The 10'000 constant is actually the default value for the track_total_hits parameter (described here). Thus, the main difference from _count is that this method of estimating the number of rows works immediately on any number of rows in store.

At the same time, if I try to make a query in ZDB with size equal to 0, we come across this behavior:

Some(limit) if limit == 0 => { // with a limit of zero, we can avoid going to Elasticsearch at all // and just return a (mostly) None'd response return Ok(ElasticsearchSearchResponse { elasticsearch: None, limit: Some(0), offset: None, track_scores, should_sort_hits, scroll_id: None, shards: None, hits: None, fast_terms: None, }); }

Consequently, the problem:

How can I query ES via ZDB with size 0 ?

How do I get the value of hits.total.value from the request result?

How to manage track_total_hits parameter (to change value 10'000 to any other)? I couldn't find anything about it in ZDB.

By the way, this whole problem can be solved as follows: perhaps a function should be added to aggregate functions that returns the estimated number of rows in the request (according to the method I described) ?

It seems to me that it would be logical and useful for many to have a special function for this. Especially for those who work with a very large collection of data. The definition of the function could be this:

FUNCTION zdb.estimate_count( index regclass, track_total_hits integer DEFAULT 10000, query zdbquery ) RETURNS integer
enhancement
opened by hyperion-cs 2
Multi table search in zombo

Zombo used to support multi table search https://github.com/zombodb/zombodb/pull/70

How would you recommend going about doing something similar in the current version?

Thanks.
enhancement

opened by bhalonen 4

Releases(v3000.1.8)

v3000.1.8(Dec 24, 2022)

This is ZomboDB v3000.1.8. It fixes a crashing bug in the find_zdb_index function that seemingly "only" shows up on this Intel CPU:

processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 63
model name      : Intel(R) Xeon(R) CPU E5-2670 v3 @ 2.30GHz
stepping        : 2
microcode       : 0x49
cpu MHz         : 2294.686
cache size      : 30720 KB
physical id     : 0
siblings        : 1
core id         : 0
cpu cores       : 1
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 15
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon nopl xtopology tsc_reliable nonstop_tsc cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single pti ssbd ibrs ibpb stibp fsgsbase tsc_adjust bmi1 avx2 smep bmi2 invpcid xsaveopt arat md_clear flush_l1d arch_capabilities
bugs            : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit
bogomips        : 4589.37
clflush size    : 64
cache_alignment : 64
address sizes   : 45 bits physical, 48 bits virtual

The bug is that ZDB was passing a null pointer off to libc's strlen(), which clearly is bad, not intentional, and undefined behavior. If you happen to see a crash with a backtrace similar to:

Program terminated with signal SIGSEGV, Segmentation fault.
#0  __strlen_avx2 () at ../sysdeps/x86_64/multiarch/strlen-avx2.S:65
65      ../sysdeps/x86_64/multiarch/strlen-avx2.S: No such file or directory.
(gdb) bt
#0  __strlen_avx2 () at ../sysdeps/x86_64/multiarch/strlen-avx2.S:65
#1  0x00007f60588c1106 in ?? () from /usr/lib/postgresql/14/lib/zombodb.so
#2  0x00007f60586b9e42 in ?? () from /usr/lib/postgresql/14/lib/zombodb.so
#3  0x00007f605893ba08 in determine_index_wrapper () from /usr/lib/postgresql/14/lib/zombodb.so

Please upgrade.

Please upgrade anyways and Happy Holidays!

What's Changed

TargetEntry.resname is documented as maybe being null. So we need to account for this by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/794

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.7...v3000.1.8

Source code(tar.gz)
Source code(zip)

v3000.1.7(Dec 3, 2022)
This is ZomboDB v3000.1.7, now with Postgres 15 support!

Important Bugs Fixed

The past few versions (v3000.1.4-v3000.1.6) had a serious bug where indexing a table with date or time columns could cause Postgres to crash. This was fixed by upgrading ZomboDB to the latest pgx version (0.6.0).

These versions have been removed from the customer portal at www.zombodb.com.

Bugs Fixed

Issue #790: Proximity queries, while being parsed correctly, were not building the correct Elasticsearch QueryDSL

Other Things

ZomboDB now supports Postgres 15

ZomboDB no longer supports Postgres 10

Paid sponsors at $125/mo or more get access to prebuit binaries for these operating systems:

Alpine Linux 3.12

Amazon Linux2

Ubuntu Bionic

Ubuntu Focal

Ubuntu Jammy

Ubuntu Kinetic

Debian Bookworm

Debian Bullseye

Debian Buster

Debian Sid

Fedora 35

Fedora 36

Fedora 37

Source code(tar.gz)
Source code(zip)
v3000.1.6(Nov 16, 2022)

This is ZomboDB v3000.1.6.

It changes the offset values returned from zdb.highlight_document() to be counted in UTF16 instead of UTF8. This makes the start/end_offset values compatible with a Java char[] of the original text. Sometimes in software development there's no right answer, so picking the answer that pays your bills is the most prudent.
Source code(tar.gz)
Source code(zip)
v3000.1.5(Oct 21, 2022)

This is ZomboDB v3000.1.5. While there's no code changes since v3000.1.4, upstream dependencies have been upgraded, which fixes a bug where v3000.1.4 could would silently convert ARRAY[0, 1, 2, 3] into ARRAY[NULL, 1, 2, 3] and ARRAY[false, true] into ARRAY[NULL, true].

This silent data conversion could have potentially happened during CREATE INDEX or INSERT/UPDATE statements.

It's fairly important that if you're on v3000.1.4, you upgrade immediately.

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.4...v3000.1.5
Source code(tar.gz)
Source code(zip)
v3000.1.4(Oct 17, 2022)
This is ZomboDB v3000.1.4. It is a bugfix, feature, and performance update.

Bugs Fixed

Issue #759: ZomboDB now properly handles DELETEs to new rows that occur in the transaction before Elasticsearch has had a chance to accept the original INSERT/UPDATE.

Issue #770: The Elasticsearch date_format options used for time with(out) time zone types were incomplete. This adjusts ZomboDB's mapping for these types. As such, you'll need to manually execute SELECT zdb.reapply_mapping('index_name'::regclass); for every USING zombodb index. This operation is fairly fast and doesn't necessitate a reindex.

Issue #771: When using shadow indexes (CREATE INDEX ... WITH (shadow=true)) you no longer need to either specify the url= option and/or have zdb.default_elasticsearch_url set in postgresql.conf.

Performance Enhancements

Issue #766: The zdb.highlight_document() function has been significantly optimized, allowing to highlight even very large documents (tens of megabytes) significantly faster than before. Small documents will also see benefits.

Issue #757: ZomboDB is now smarter about when to pass the track_scores option to Elasticsearch search queries, and when it can elide the option, search results return much faster

New Features

implement single-arg zdb.highlight_all_fields(ctid) to fetch all highlights by @brncsk in https://github.com/zombodb/zombodb/pull/774

New Contributors

@brncsk made their first contribution in https://github.com/zombodb/zombodb/pull/776

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.3...v3000.1.4

Sponsor ZomboDB

If you'd like pre-built ZomboDB binaries, please consider sponsoring for $75/mo
Source code(tar.gz)
Source code(zip)
v3000.1.3(Jul 20, 2022)
This is ZomboDB v3000.1.3. It's a minor bugfix release that fixes an issue related to hit highlighting.

Issues Fixed

Issue #754 - when using a "phrase query" to highlight text indexed by a tokenizer that produces ngrams (such as the zdb.fulltext_with_shingles tokenizer, ZDB would cause a stack overflow

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.2...v3000.1.3
Source code(tar.gz)
Source code(zip)
v3000.1.2(Jul 5, 2022)

🚨This is ZomboDB v3000.1.2. It's a fairly important update as it fixes two different bugs in prior version upgrade scripts.🚨

If you have currently have installed a version prior to v3000.1.0, you should upgrade to this version, not an intermediate version. Doing so could leave your database in an indeterminate state and delete all existing ZomboDB indices.

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.1...v3000.1.2
Source code(tar.gz)
Source code(zip)
v3000.1.1(Jun 29, 2022)
This is ZomboDB v3000.1.1. It fixes an intermittent issue that can cause otherwise successful transactions to abort with an IndexOutOfBoundsException from Elasticsearch when Postgres savepoints/subtransactions are in use.

What's Changed

fix issue #750 by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/752

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.1.0...v3000.1.1

For prebuilt binaries, please consider sponsoring for only $75/mo.
Source code(tar.gz)
Source code(zip)
v3000.1.0(Jun 17, 2022)
This is ZDB v3000.1.0. It's a bump in the minor release as the zdb.highlight_document() function signature has been overloaded to accept either json or jsonb, and this might cause user SQL to break if the argument type is ambiguous.

What's Changed

Highlight big docs by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/746 ZDB can now highlight "big documents". If the document length is over 100MB, ZDB will do simple unicode word segmentation directly, rather than ask Elasticsearch to analyze the text. Note that Postgres' jsonb type has a limit of about 200MB for a single property value, so if you do have a large document, you should use the json type instead.

Issue 747 by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/748 This resolves some problems with queries using joined fields when used by some aggregate functions, such as zdb.tally().

proximity 'w/' and 'wo/' operators should be case-insensitive by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/749

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.0.12...v3000.1.0
Source code(tar.gz)
Source code(zip)
v3000.0.12(May 4, 2022)
This is ZomboDB v3000.0.12. It is a minor bugfix release.

What's Changed

Update docs about row_estimate by @hyperion-cs in https://github.com/zombodb/zombodb/pull/732

Clarification of restrictions on a function that returns a custom type by @hyperion-cs in https://github.com/zombodb/zombodb/pull/736

Issue #730 by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/734

Update CROSS-INDEX-JOINS.md by @bhalonen in https://github.com/zombodb/zombodb/pull/737

fix issue #739 by @eeeebbbbrrrr in https://github.com/zombodb/zombodb/pull/740

New Contributors

@bhalonen made their first contribution in https://github.com/zombodb/zombodb/pull/737

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.0.11...v3000.0.12
Source code(tar.gz)
Source code(zip)
v3000.0.11(Mar 31, 2022)
This is ZomboDB v3000.0.11. It is a minor bugfix release.

What's Fixed

Issue #732: Aggregate function query retargeting now works correctly

Issue #720: zdb.highlight_document() now correctly highlights nested json objects

What's New?

Issue #722: Added a include_source index configuration feature (default: true) to allow turning off the _source field in the backing Elasticsearch index

Thanks!

Thanks to all users and sponsors.

For a mere $75/mo you can receive pre-compiled ZomboDB binaries for a number of Linux distrobutions.
Source code(tar.gz)
Source code(zip)
v3000.0.10(Mar 1, 2022)

This is ZomboDB v3000.0.10. It is a minor bugfix release that resolves issue #718.

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.0.9...v3000.0.10
Source code(tar.gz)
Source code(zip)
v3000.0.9(Feb 21, 2022)

This is ZomboDB v3000.0.9. There are no changes since v3000.0.8 -- only updated artifacts.
Source code(tar.gz)
Source code(zip)
v3000.0.8(Feb 3, 2022)
This is ZomboDB v3000.0.8. It is a minor bugfix release that resolves:

Fix type mismatch on non-Intel archs by @alekitto in https://github.com/zombodb/zombodb/pull/709

Issue #713: Invalid parsing of quoted field names

New Contributors

@alekitto made their first contribution in https://github.com/zombodb/zombodb/pull/709

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.0.7...v3000.0.8
Source code(tar.gz)
Source code(zip)
v3000.0.7(Dec 17, 2021)

This is ZomboDB 3000.0.7. It contains a number of bugfixes primarily related to query generation/execution.

Fixed Bugs

83ae670ad8a128154f230d9805aa59ee6d4a73f0: Issue #209 came back in ZDB3k. It is now fixed 0aa1d08f0b1abbd27c15f6b4de63e3460c2061fe: Issue #206 came back in ZDB3k. It is now fixed d66c446be27fcc8a02f16c086e22cb4fbb95df37 / 934c682bf8f688fea4fa3c1631a35ef026596ec0: Fix bugs related to the detection of nested json fields in a ZQL Query 1c57e4cfa99e351577e0ca81021fb70a41e34014: Fix bugs related to the detection of the number of non-shadow indexes a table has 653f3b2b8e43c6a6b3bd68331bbd4730cc73ae9c: Port the (very) old "pgTap" tests forward to ZDB3k. These add an additional 520+ tests around ZomboDB's query language (ZQL)

Thanks!

Thanks for using ZomboDB!

If you'd like binary artifacts, please become a $75/mo sponsor. Note that in January 2022 the tier will change to $100/mo for all new sponsors. Existing $75/mo sponsors will remain at that tier.
Source code(tar.gz)
Source code(zip)
v3000.0.6(Nov 23, 2021)
This is ZomboDB v3000.0.6. It is a minor bugfix release.

Bugs Fixed

Issue #679: Using NOT where its contained clause is a linked field now works correctly (again)

Issue #688: IndexLinks to schema.table.index with non-alpha-numeric characters can be quoted with backticks

Full Changelog: https://github.com/zombodb/zombodb/compare/v3000.0.5...v3000.0.6
Source code(tar.gz)
Source code(zip)
v3000.0.5(Nov 11, 2021)

This is a minor update to v3000.0.4 that fixes the SQL upgrade scripts from v3000.0.3.

While it's still tagged here on GitHub, we recommend that nobody use v3000.0.4 as upgrading to it from a previous version can delete your indexes.
Source code(tar.gz)
Source code(zip)
v3000.0.4(Nov 10, 2021)
This is ZomboDB v3000.0.4. It is primarily a bugfix release plus support for PostgreSQL 14.

New Features

Postgres 14 support (thanks to pgx v0.2)

Indexing performance improvements

We now properly wait for all active shards to be ready after CREATE INDEX

Resolved Issues

#675: "quoted phrases" properly rewrite to Elasticsearch match_phrase queries

#677: "Cannot open relation with oid=0" when altering tables without a ZomboDB index

#678: Support Postgres 13+'s IncrementalSort node

#679: Queries such as WHERE t ==> 'NOT (joined_field = 42)' now work correctly

#683: Prefix queries are no longer case sensitive

#688: Support back-ticks for quoting fully-qualified table names in index linking options

Note

Ubuntu Xenial has reached its end of life. As such, it is no longer supported by ZomboDB.

Thanks

Thanks to everyone that helped make this release possible, mostly especially @TCDI and all the Sponsors!
Source code(tar.gz)
Source code(zip)
v3000.0.3(Aug 16, 2021)
This is ZomboDB 3000.0.3. It is a minor bugfix and feature release, resolving a number of issues reported since v3000.0.1 (3000.0.2 was an internal release).

Bugs Fixed

#673: Individual proximity search values (Car w/3 Truck) are now run through analysis

#672: The "ordery_by" argument for zdb.tally() and zdb.terms() functions are now properly documented and the docs match the implementation

#688: The documentation for zdb.significant_terms() has been updated to match its implementation

New Features

A new index-level configuration option (issue #669) named max_analyze_token_count which can be increased from its default of 10,000 to allow large documents to be highlighted

A new index-level configuration option (issue #666) named nested_objects_limit which controls the complexity of json(b) columns during indexing. The default is 10,000 but can be increased for more complex data structures

Thanks!

Thanks to everyone who reported the above issues.

Binary Downloads

Binary downloads for various Linux distributes are available to sponsors for $75/mo
Source code(tar.gz)
Source code(zip)
v3000.0.1(Jul 26, 2021)

This is ZomboDB v3000.0.1. It is a minor release that fixes a bug in the "ZomboDB Search Accelerator" Elasticsearch plugin, and also resolves issue #657.

If you'd like pre-build binaries for ZomboDB, please sponsor here
Source code(tar.gz)
Source code(zip)
v3000.0.0(Jun 28, 2021)
ZomboDB v3000.0.0

This is ZomboDB v3000.0.0. It's a (mostly) faithful rewrite from C to Rust, using our Rust framework for creating Postgres extensions pgx.

In general, it is intended to be backwards compatible with ZomboDB 4.0, but there are some user-facing changes and new features.

New Features

Support for more Elasticsearch aggregate functions

A new, default query language: ZQL

Custom hit highlighting functions

Ability to perform cross-index joins

much more!

Upgrading From Previous Releases

If upgrading from ZomboDB v3000.0.0-beta1, you can simply install the new binaries and execute ALTER EXTENSION zombodb UPDATE; in every database.

If upgrading from older versions, you'll need to drop and re-create all your USING zombodb indices as they're not compatible with ZomboDB 3000.

Downloading

Now that we're out of alpha/beta (and many thanks to those that reported bugs!) binary downloads are not available without becoming a sponsor at the $75/mo tier.

Otherwise, building from source (or via the docker-build-system) is of course, allowed.
Source code(tar.gz)
Source code(zip)
v3000.0.0-beta1(Feb 22, 2021)
This is ZomboDB 3000.0.0-beta1. It is a minor bugfix release relative to -alpha4 and if things go as planned will be the last release prior to a final, production-ready release.

-beta1 now supports upgrading the extension (from -alpha4 only), so if you're upgrading from -alpha4, simply install the new packages as normal and run ALTER EXTENSION zombodb UPDATE; in every database with ZomboDB.

Fixed Bugs

Issue #632 - Rows updated prior to creating a USING zombodb index are now visible by the backing Elasticsearch index

Issue #633 - ZQL no longer generates "prefix" queries for nested object properties of type keyword

Issue #634 - Handle a few more parse tree nodes when examining queries for "zombodb-isms"

New Features

A number of additional Elasticsearch aggregate queries have been wrapped as "agg builder" functions. See the AGGREATE-BUILDER-API docs for more details

Sponsor Our Work!

Please consider sponsoring our work. ZomboDB development requires a tremendous amount of effort and resources. We appreciate your consideration!
Source code(tar.gz)
Source code(zip)
zombodb_alpine-3.12_10-3000.0.0-beta1.x86_64.apk(3.12 MB)
zombodb_alpine-3.12_11-3000.0.0-beta1.x86_64.apk(3.12 MB)
zombodb_alpine-3.12_12-3000.0.0-beta1.x86_64.apk(3.13 MB)
zombodb_alpine-3.12_13-3000.0.0-beta1.x86_64.apk(3.13 MB)
zombodb_amazonlinux-2_10-3000.0.0-beta1_1.x86_64.rpm(3.10 MB)
zombodb_amazonlinux-2_11-3000.0.0-beta1_1.x86_64.rpm(3.10 MB)
zombodb_amazonlinux-2_12-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_amazonlinux-2_13-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_centos-8_10-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_centos-8_11-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_centos-8_12-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_centos-8_13-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_debian-bullseye_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-bullseye_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-bullseye_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_debian-bullseye_13-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-buster_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-buster_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-buster_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_debian-buster_13-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-sid_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-sid_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-sid_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_debian-sid_13-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-stretch_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-stretch_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_debian-stretch_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_debian-stretch_13-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_fedora-31_10-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-31_11-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-31_12-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-31_13-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-32_10-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-32_11-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-32_12-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-32_13-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-33_10-3000.0.0-beta1_1.x86_64.rpm(3.10 MB)
zombodb_fedora-33_11-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-33_12-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_fedora-33_13-3000.0.0-beta1_1.x86_64.rpm(3.11 MB)
zombodb_ubuntu-bionic_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-bionic_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-bionic_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_ubuntu-bionic_13-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_ubuntu-focal_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-focal_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-focal_12-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-focal_13-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-xenial_10-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-xenial_11-3000.0.0-beta1_amd64.deb(3.13 MB)
zombodb_ubuntu-xenial_12-3000.0.0-beta1_amd64.deb(3.14 MB)
zombodb_ubuntu-xenial_13-3000.0.0-beta1_amd64.deb(3.13 MB)
v3000.0.0-alpha4(Jan 17, 2021)
ZomboDB 3000.0.0-alpha4

Compared to -alpha3, this release contains a few bugfixes, SQL-level API changes, and some Elasticsearch index mapping changes which will necessitate indexes be re-built.

We don't yet have an ALTER EXTENSION zombodb UPDATE; process in place, so upgrading to this version will require that you install the new extension on your Postgres server and then DROP EXTENSION zombodb; CREATE EXTENSION zombodb; and then re-create any of your indexes.

First of all, THANKS!

Thanks to our current sponsors and customers. Funding goes a long way towards making great software.

Bugfixes

Issue #629 - During SELECT statements, per-tuple errors from Elasticsearch are now properly reported and will cause the statement to raise an ERROR.

New Features

More aggregate builder functions have been added to cover the Box_plot, Geo_Centroid, Median_Absolute_Deviation, Percentiles, String_Stats, Weighted_Avg, Top_Metric, T_Test, and Value_Count aggregates.

SQL API Changes

The arguments to dsl.constant_score() have been reordered such that boost is the first argument.

The return type of zdb.suggest_terms() has been changed to provide a more useful result set.

The optional minimum_should_match argument of various dsl.* functions has changed to be of type integer.

zdb.highlight_document() now understands how to highlight individual array elements and nested json object properties. It also now returns an additional column named array_index.

Elasticsearch Index Mapping Changes

turn fielddata on by default for the 'zdb.fulltext_with_shingles' type

all date/timestamp/time fields now generate a mapping where the field itself is of type "keyword" but contains a subfield (named "date") that is of type "date". zdb.tally() has been updated to understand this, but all other aggregate functions will require one to use the field "fieldname.date" if it's necessary to aggregate on the actual date value as stored in Elasticsearch.

Source code(tar.gz)
Source code(zip)
zombodb_alpine-3.12_10-3000.0.0-alpha4.x86_64.apk(3.09 MB)
zombodb_alpine-3.12_11-3000.0.0-alpha4.x86_64.apk(3.09 MB)
zombodb_alpine-3.12_12-3000.0.0-alpha4.x86_64.apk(3.09 MB)
zombodb_alpine-3.12_13-3000.0.0-alpha4.x86_64.apk(3.09 MB)
zombodb_centos-8_10-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_centos-8_11-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_centos-8_12-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_centos-8_13-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_debian-bullseye_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-bullseye_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-bullseye_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-bullseye_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-buster_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-buster_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-buster_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-buster_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-sid_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-sid_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-sid_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-sid_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-stretch_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-stretch_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_debian-stretch_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_debian-stretch_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_fedora-31_10-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-31_11-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-31_12-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_fedora-31_13-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_fedora-32_10-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-32_11-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-32_12-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_fedora-32_13-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_fedora-33_10-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-33_11-3000.0.0-alpha4_1.x86_64.rpm(3.07 MB)
zombodb_fedora-33_12-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_fedora-33_13-3000.0.0-alpha4_1.x86_64.rpm(3.08 MB)
zombodb_ubuntu-bionic_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_ubuntu-bionic_11-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-bionic_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-bionic_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-focal_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_ubuntu-focal_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_ubuntu-focal_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-focal_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-xenial_10-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_ubuntu-xenial_11-3000.0.0-alpha4_amd64.deb(3.09 MB)
zombodb_ubuntu-xenial_12-3000.0.0-alpha4_amd64.deb(3.10 MB)
zombodb_ubuntu-xenial_13-3000.0.0-alpha4_amd64.deb(3.10 MB)
v3000.0.0-alpha3(Jan 1, 2021)
ZomboDB 3000.0.0-alpha3

Compared to -alpha2, this release contains a number of bugfixes, performance improvements, and a few new features.

We don't yet have an ALTER EXTENSION zombodb UPDATE; process in place, so upgrading to this version will require that you install the new extension on your Postgres server and then DROP EXTENSION zombodb; CREATE EXTENSION zombodb; and then re-create any of your indexes.

First of all, THANKS!

We want to give a special shout-out to https://github.com/mathroc for all the awesome bug reports. ZomboDB 3000.0.0-alpha3 wouldn't be as good as it is without him.

We also want to thank our current sponsors and customers. Funding goes a long way towards making great software.

And while we're here, 🎉 HAPPY NEW YEAR EVERYONE! 🎉

Bugfixes

Issue #613 - Returning large result sets is no longer "slow". And in conjunction with our commercial "ZomboDB Search Accelerator", it's incredibly fast. More on all of this below

Issue #614 - Queries with mixed-case field names now parse

Issue #616 - Swapped out HTTP clients for a lot less code and runtime complexity. Thanks ureq!

Issue #618 - Creating a USING zombodb index on a table with existing HOT updates now works correctly

Issue #620 - Improve error message around unexpected uses of the ==> operator

Issue #622 - Tuple updates caused by recursive trigger execution now update properly

Issue #624 - Properly escape encoded JSON values

Issue #626 - ZomboDB's Query Parser (ZQL) now uses an exclusion set for tokenization, allowing non-ascii characters to be used as search values

01b930faad0ebf79d4f3249e07a6f3e2abc3b6e2 - Date range searches now work

New Index Configuration Options

max_result_window[integer]: controls the number of docs we retrieve from ES in one request

shadow[boolean]: allow a new index to have different options but not consume storage. See the CROSS-INDEX-JOINS.md documentation for details

nested_object_date_detection[boolean]: should ES do date detection while indexing nested objects?

nested_object_numeric_detection[boolean]: should ES do number detection while indexing nested objects?

nested_object_text_mapping[text/json]: what is the field mapping definition ES should use while indexing nested object properties that are determined to be "strings"?

nested_fields_limit[integer]: what's the maximum number of nested fields allowed in an index?

total_fields_limit[integer]: what's the maximum number of fields allowed in an index?

max_terms_count[integer]: what's the maximum number of terms allowed in a single "terms" query? The default is 65535.

New Features

Specifying the index_name Argument

Any ZomboDB function that requires an index_name as an argument (these are all in the zdb schema) can now take the actual index name, the underlying table name, or a view name that uses a ZomboDB ==> query. ZomboDB will figure out the underlying index to use.

Shadow Indexes

Shadow Indexes are a way to create additional ZomboDB indexes that specify different linking options without using additional storage. The process for using these is described in CROSS-INDEX-JOINS.md.

Index Links (joining across indexes) are Solved within Postgres

If you don't have a license to the "ZomboDB Search Accelerator", ZomboDB can still solve cross-index-joins.

While this works, it is significantly slower without the "ZomboDB Search Accelerator". It may also require that the new max_terms_count index WITH property be set to a value higher than the default.

New Query Language Features

ZomboDB's query language has been given a name: ZQL.

field names can be now be quoted with backticks if you have field names that aren't in the set [a-zA-Z0-9_].

a new operator has been added within ZQL itself that maps to the Elasticsearch "match" query: WHERE table ==> 'id:42 or title==>"food for thought"'

field:"phrase with * * * * wildcards" is now optimized into a span query with slop instead of inserting "wildcard" queries (issue #615)

Support for Elasticsearch's "Similarity Module"

ZomboDB now supports defining custom similarity functions, which is handled in a manner similar to defining custom analyzers. See the docs for zdb.define_similarity()

New Aggregation Builder SQL Functions

ZomboDB now provides a number of SQL functions for dynamically building Elasticsearch Aggregate searches. These functions live in the zdb schema and are suffixed with _agg and are documented here

They're designed to make it easy to build arbitrarily-complex aggregates, using ZomboDB's arbitrary aggregate support.

Performance Improvements

Issue #613 brought about a number of significant performance improvements related to returning large result sets (hundreds-of-thousands to millions of rows).

A summary of the changes are:

Removed a significant amount of per-tuple processing overhead

Use CBOR instead of JSON as the transport format -- reduces network bandwidth by more than 20%

When creating an index, the docs are now stored in order by the hidden "zdb_ctid" column, if the underlying table doesn't contain json/jsonb columns

This enables ZomboDB to sequentially return heap tuples without also doing a sort in ES, which is much nicer to the disk subsystem and the ES cluster

When ZomboDB can't physically order the ES index by "zdb_ctid", or when a search includes sort, a limit/offset, or uses zdb.score(), ZomboDB sorts each block of 10k docs that are returned by "zdb_ctid", effectively providing the same feature, but in blocks of 10k instead of across the entire returned set.

When a search doesn't have a sort, a limit/offset, and doesn't use zdb.score() the "ZomboDB Search Accelerator" can be used, drastically improving search performance -- more than 5x faster

The max_result_window property can be set per index, which can improve performance by reducing the number of round-trips to ES, at the expense of increased ES memory consumption during search.

Additionally, some work has been done on improving indexing performance by reducing per-tuple processing overhead, gaining ~7-10%. Also, the queueing coordination for the background indexer threads has been improved to be more resilient in the face of errors from Elasticsearch.

Thanks!

Thanks to everyone that submitted issues and PRs during -alpha2. Also thanks to our sponsors and customers for their work and support.

Please consider sponsoring our work!
Source code(tar.gz)
Source code(zip)
zombodb_alpine-3.12_pg10-3000.0.0-alpha3.x86_64.apk(3.00 MB)
zombodb_alpine-3.12_pg11-3000.0.0-alpha3.x86_64.apk(3.00 MB)
zombodb_alpine-3.12_pg12-3000.0.0-alpha3.x86_64.apk(3.00 MB)
zombodb_alpine-3.12_pg13-3000.0.0-alpha3.x86_64.apk(3.00 MB)
zombodb_centos-8_pg10-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_centos-8_pg11-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_centos-8_pg12-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_centos-8_pg13-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_debian-bullseye_pg10-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-bullseye_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-bullseye_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-bullseye_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-buster_pg10-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-buster_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-buster_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-buster_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-sid_pg10-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-sid_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-sid_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-sid_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-stretch_pg10-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-stretch_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-stretch_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_debian-stretch_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_fedora-31_pg10-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-31_pg11-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-31_pg12-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-31_pg13-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-32_pg10-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-32_pg11-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-32_pg12-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-32_pg13-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-33_pg10-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-33_pg11-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-33_pg12-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_fedora-33_pg13-3000.0.0-alpha3_1.x86_64.rpm(2.99 MB)
zombodb_ubuntu-bionic_pg10-3000.0.0-alpha3_amd64.deb(3.02 MB)
zombodb_ubuntu-bionic_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-bionic_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-bionic_pg13-3000.0.0-alpha3_amd64.deb(3.02 MB)
zombodb_ubuntu-focal_pg10-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-focal_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-focal_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-focal_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-xenial_pg10-3000.0.0-alpha3_amd64.deb(3.02 MB)
zombodb_ubuntu-xenial_pg11-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-xenial_pg12-3000.0.0-alpha3_amd64.deb(3.01 MB)
zombodb_ubuntu-xenial_pg13-3000.0.0-alpha3_amd64.deb(3.01 MB)
v3000.0.0-alpha2(Nov 24, 2020)
This is ZomboDB 3000, alpha2.

Beyond -alpha1, this release fixes a few bugs and adds some new SQL-level UDFs for inspecting index properties.

Probably the biggest change since alpha1 is that -alpha2 also brings Postgres 13 support!

There is no support for ALTER EXTENSION zombodb UPDATE yet, so to upgrade to alpha2 you'll need to DROP EXTENSION zombodb, install the new extension on the host system, and the CREATE INDEX zombodb. We'll have ALTER EXTENSION UPDATE support for alpha3 and above.

Changes

1b893805ce19c61d93c2f37363cdae371f902ec1: Any ZomboDB SQL-level function that takes an "index_name" argument can now take a regclass that either specifies an index, a table, or a view. ZomboDB will then determine which underlying ZomboDB index to use.

09761629840132f7336701435826d265251e6ba1: Related to the above, there's now SELECT zdb.determine_index(relation_name) that will return the underlying ZomboDB index it would use

Various documentation cleanup

Issue #608: Using ZomboDB's query language, it is now possible to search boolean fields for field:TRUE or field:FALSE.

Issue #609: CREATE INDEX no longer leaks memory

Issue #602, PR #610: The docker-build-system now also supports building Alpine .apk packages for Postgres 10,11,12,13.

5922601e07e93d7668811b80b31d5194e3cdf3b5: When creating a new index, we wait for yellow status before we can use it (up to 10 minutes)

c8b47f2aafdea7a04f0f7ad33eaccf746e06a31a: Queries that generate a prefix DSL node now supply a rewrite parameter to work around a bug in Elasticsearch

d3e73e879ee0d6fb8f7e9854bab65fb5e48730f4: Escaped characters in query values are now properly unescaped before rewriting to Query DSL

7cd0007ad8c29989af8f71bd1195757f1319cf74, 4633fd418ca7d00950e6d12354fcb0a60ee4515b: Various legacy regression test updates

23ac216ef3191aa99a9bfd13586a78143de5607e: Postgres 13 support

b4471424a69594eb51f51137d0cf6b6dda38274c, 9bbfacfd3d5987fc49b18f87c956bc6cfb627ecd, f88f15231526dc357bf088ccc39208c1614a7ba8: ./docker-build-system/ updated to support Postgres 13, and only distros that also support PG 13

1deab64d071d693869a4b48fbeb08b129b58a50a: .deb and .rpm package names (in their metadata) now include the Postgres Version for which they're build, so more than one can be installed on a single system

e05a0ca0ac8fe39ddcf53eaf62f3cedba6694d95: More ZDB UDFs are now parallel_safe

6ffab1b70c772d479bce6ad1e57b4b774d6363e7: The zdb.phrase and zdb.phrase_array type mappings now have "fielddata": true by default

6261414261c2abb8a076bd080c4fbff846424967: The zdb.fulltext_with_shingles mapping now also applies the zdb_truncate_to_fit filter to ensure terms are within the term length limits required by Elasticsearch

64be60552ff350d7c05eb4a6b8cedbc46e4e72f2: Added SELECT zdb.index_field_lists(index_name) UDF to return the field lists defined on an index

2ef5a84ebe14c1de41279f3058aac825d187f7d5: Added SELECT zdb.field_mapping(index_name, field_name) UDF to return the Elasticsearch index mapping definition for the specified field. This is especially useful for determining the shape of dynamically defined "nested object" fields that originate in json/jsonb columns.

93587f44988b5c083e4f7f3edb6feed926072068: Added support for Elasticsearch's "terms suggester" via SELECT zdb.suggest_terms(...)

Thanks!

Thanks to everyone that submitted issues and PRs during -alpha1. Also thanks to our sponsors and customers for their work and support.

Please consider sponsoring our work!
Source code(tar.gz)
Source code(zip)
zombodb_alpine-3.12_pg10-3000.0.0-alpha2.x86_64.apk(2.85 MB)
zombodb_alpine-3.12_pg11-3000.0.0-alpha2.x86_64.apk(2.85 MB)
zombodb_alpine-3.12_pg12-3000.0.0-alpha2.x86_64.apk(2.85 MB)
zombodb_alpine-3.12_pg13-3000.0.0-alpha2.x86_64.apk(2.85 MB)
zombodb_centos-8_pg10-3000.0.0-alpha2_1.x86_64.rpm(2.86 MB)
zombodb_centos-8_pg11-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_centos-8_pg12-3000.0.0-alpha2_1.x86_64.rpm(2.86 MB)
zombodb_centos-8_pg13-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_debian-bullseye_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-bullseye_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-bullseye_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_debian-bullseye_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-buster_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-buster_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-buster_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_debian-buster_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-sid_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-sid_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-sid_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_debian-sid_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-stretch_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-stretch_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_debian-stretch_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_debian-stretch_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_fedora-31_pg10-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-31_pg11-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-31_pg12-3000.0.0-alpha2_1.x86_64.rpm(2.86 MB)
zombodb_fedora-31_pg13-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-32_pg10-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-32_pg11-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-32_pg12-3000.0.0-alpha2_1.x86_64.rpm(2.86 MB)
zombodb_fedora-32_pg13-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-33_pg10-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-33_pg11-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_fedora-33_pg12-3000.0.0-alpha2_1.x86_64.rpm(2.86 MB)
zombodb_fedora-33_pg13-3000.0.0-alpha2_1.x86_64.rpm(2.85 MB)
zombodb_ubuntu-bionic_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-bionic_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-bionic_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_ubuntu-bionic_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-focal_pg10-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-focal_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-focal_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_ubuntu-focal_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-xenial_pg10-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_ubuntu-xenial_pg11-3000.0.0-alpha2_amd64.deb(2.87 MB)
zombodb_ubuntu-xenial_pg12-3000.0.0-alpha2_amd64.deb(2.88 MB)
zombodb_ubuntu-xenial_pg13-3000.0.0-alpha2_amd64.deb(2.87 MB)
v3000.0.0-alpha1(Nov 5, 2020)
ZomboDB 3000 (alpha1)!

This is ZomboDB 3000. It's a mostly faithful rewrite from C to Rust (using our Rust framework for Postgres extensions called pgx.

In general, it is intended to be backwards compatible with ZomboDB 4.0, but there are some user-facing changes and new features.

Versioning

The full version is v3000.0.0.

We decided to align ZomboDB's versioning scheme with semver, but we changed the major number to something absurd (3000!) to indicate that this is definitely a new version of ZomboDB based on new technology.

Once we're done with this alpha/beta testing phase patch/bugfixes will increase the PATCH value, new (backwards-compatible) features will increase the MINOR value. Changes that require user-facing changes will increase the MAJOR value.

Postgres Version Support

ZomboDB 3000 now supports Postgres 10, 11, 12. Postgres 13 support will be forthcoming, once pgx is upgraded to support Postgres 13.

Elasticsearch Version Support

ZomboDB 3000 now (only) supports Elasticsearch's 7.x series.

New Features

ZomboDB 3000 brings two major new features over v4.0. The first is a new query language, and the second is an additional approach to hit highlighting.

The query language is documented in QUERY-SYNTAX.md. This query language is actually something that one of the first versions of ZomboDB used to have, and it provides a number of sophisticated constructs for text search, including advanced proximity searching and cross-index joins (the cross-index join capabilities require a commercial license to our otherwise-optional Elasticsearch plugin. Please contact [email protected] for details).

Old Features Removed

Compared to ZomboDB 4.0, two features have been removed. ZomboDB no longer supports the TABLESAMPLE clause, and it no longer has a "low-level API". These features weren't widely used and their value doesn't outweigh the complexity of future maintenance.

Upgrading from Previous Versions

There is no upgrade-path from previous versions. You'll need to drop the existing ZomboDB extension, install ZomboDB 3000, and then re-create the extension and all your USING zombodb indices.

Going forward, ZomboDB 3000 will have proper upgrade paths such that ALTER EXTENSION zombodb UPDATE will work as expected.

Downloading/Building ZomboDB 3000

The build-from-source process for ZomboDB 3000 is quite a bit different from past versions of ZomboDB as it now uses the Rust toolchain. This has been documented in [SOURCE-INSTALLATION.md](https://github.com/zombodb/zombodb/blob/master/SOURCE-INSTALLATION.md].

During the alpha/beta phase, we will be releasing pre-build binaries (as .deb/.rpm packages) for all supported Postgres versions across 13 different Linux distros. They'll be available for direct download here on GitHub.

Once the final version is released, binary releases will only be available to sponsors at our $75/mo tier or higher. Building binary releases is incredibly expensive and time consuming. We appreciate your support!

Known Issues

As of this alpha-1 release, there are no known issues, but we have intentions to finish up a few features:

Documentation updates

improve the SQL API for ZomboDB 3000's new hit highlighting support

support Elasticsearch's "term suggester" API

support week offsets for zdb.tally() when used on a date field

Postgres 13 support (contingent on PG13 support for pgx)

Other Notes

Why Rust?

Primarily, we wanted better compile-time guarantees that ZomboDB isn't going to crash your Postgres server. We haven't necessarily had problems with that with the C version, but it's a great feature to have those guarantees. Note that ZomboDB does use some unsafe Rust (as does pgx) -- nothing is perfect.

We also wanted it to be easier to support multiple versions of Postgres from the same code-base. Rust's feature gating capabilities are perfect for this kind of thing, and don't obfuscate the code.

Finally, the Rust crate ecosystem is quite extensive, and allows ZomboDB to take advantage of other open-source libraries.

Performance Improvements?

During our development, profiling, and testing of ZomboDB 3000 we've seen indexing improvements of 2x, and search results retrieving improvements of nearly 1.5x. Some of these improvements are due to Elasticsearch 7.x, and some are due to better code implementation/design facilitated by Rust.

As always, there's going to be room for future performance improvements as well.
Source code(tar.gz)
Source code(zip)
zombodb_centos-8_pg10-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_centos-8_pg11-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_centos-8_pg12-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_debian-bullseye_pg10-3000.0.0-alpha1_amd64.deb(3.30 MB)
zombodb_debian-bullseye_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-bullseye_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-buster_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-buster_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-buster_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-jessie_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-jessie_pg11-3000.0.0-alpha1_amd64.deb(3.32 MB)
zombodb_debian-jessie_pg12-3000.0.0-alpha1_amd64.deb(3.32 MB)
zombodb_debian-sid_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-sid_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-sid_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-stretch_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-stretch_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_debian-stretch_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_fedora-31_pg10-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-31_pg11-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-31_pg12-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-32_pg10-3000.0.0-alpha1_1.x86_64.rpm(3.28 MB)
zombodb_fedora-32_pg11-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-32_pg12-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-33_pg10-3000.0.0-alpha1_1.x86_64.rpm(3.28 MB)
zombodb_fedora-33_pg11-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_fedora-33_pg12-3000.0.0-alpha1_1.x86_64.rpm(3.29 MB)
zombodb_ubuntu-bionic_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-bionic_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-bionic_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-eoan_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-eoan_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-eoan_pg12-3000.0.0-alpha1_amd64.deb(3.32 MB)
zombodb_ubuntu-focal_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-focal_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-focal_pg12-3000.0.0-alpha1_amd64.deb(3.32 MB)
zombodb_ubuntu-xenial_pg10-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-xenial_pg11-3000.0.0-alpha1_amd64.deb(3.31 MB)
zombodb_ubuntu-xenial_pg12-3000.0.0-alpha1_amd64.deb(3.31 MB)
v5.6.16-1.0.20(Jan 16, 2020)
This is ZomboDB v5.6.16-1.0.20. It is a minor bugfix release.

Bugs Fixed

Issue #396: Queries using empty arrays (... => 'field:[]';) now propery match no records for that clause instead of all records

Commit 6812a5d9757a237f0da84a9f3351be7cdebe7f09: Fix a hit highlighting issue with Proximity clauses where they sometimes wouldn't highlight

Downloading

Please download from https://www.zombodb.com/releases/
Source code(tar.gz)
Source code(zip)
v4.0(Nov 19, 2019)
This is ZomboDB 4.0. Due to the large set of changes and the fact it now supports multiple versions of Postgres (10 & 11), we've decided to change the versioning scheme.

Versioning Scheme

Version numbers will now simply be MajorN.MinorN. Increases in the minor number will represent bugfix/small feature releases, whereas increases in the major number will represent major new feature releases, likely requiring a REINDEX.

For this release, we chose v4.0 as this is essentially the 4th major release of ZomboDB since July 2015.

Major Changes

Issue #328: Postgres 11 Support

ZomboDB is now compatible with Postgres 11, along with still being compatible with Postgres 10. There's no difference in ZomboDB functionality between Postgres 10 and Postgres 11.

Issue #253: PostGIS Support

ZomboDB now supports indexing PostGIS geometry and geography types into their corresponding Elasticsearch types. The same is true for Postgres' point type. And they can be queried using dsl.geo_shape(), dsl.geo_polygon(), and dsl.geo_bounding_box() SQL functions.

ZomboDB transparently transforms geometry and geography data to CRS 4326 during indexing. Note that queries must be in CRS 4326 too, either directly or via ST_Transform(). See POSTGIS-SUPPORT.md for more details and examples.

Other Fixes and Features

Issue #331: DROP EXTENSION zombodb; properly cleans up session_preload_libraries setting

Issue #341: Debian 9 (stretch) binary releases

Issue #343: More consistent scoring results due to improved VACUUM support

Issue #345: CREATE INDEX in aborted transaction deletes any Elasticsearch indices that may have been created.

Issue #353: ZomboDB does a better job closing Elasticsearch scroll cursors

Issue #359: INSERTs using CTEs now index the correct data

Issue #360: dsl.offset() now functions correctly

Issue #363: A dsl.datetime_range() function has been added

Issue #373: ZomboDB no longer segfaults when rolling back to a SAVEPOINT

Issue #390: ZomboDB no longer emits a "ZomboDB Loaded" log message

Add an Elasticsearch analyzer for indexing emojis. Very useful as the analyzer for a subfield of a top-level text column to index just the emoji characters.

Can now specify, via zdb.define_type_conversion(), custom datatype conversions to JSON for indexing in ES (used by the PostGIS support)

Transparently generate "nested" aggregations when using a ZomboDB aggregation function (such as zdb.terms()) against a property of a json/jsonb column

bytea types are now correctly base64 encoded and stored in Elasticsearch as type:binary.

Upgrading

If you're upgrading to this version on Postgres 10 from a previously installed version of ZomboDB, all you need to do is install the extension and run ALTER EXTENSION zombodb UPDATE; in every database where it's used.

If, now that ZomboDB supports Postgres 11, you want to upgrade from Postgres 10 to 11, you'll need to re-create all your ZomboDB indices after you upgrade Postgres. Essentially, treat this "upgrade" path as a fresh ZomboDB installation.

The Future

Sponsorship

ZomboDB (via https://github.com/sponsors/eeeebbbbrrrr) has joined GitHub's Sponsor program. Obviously, we're very excited about this program!

ZomboDB is a thing that runs inside your production Postgres database and we hope potential sponsors will take this fact into consideration. ZomboDB development requires significant time, planning, and testing to ensure your data is correctly and safely managed.

We hope there's a sponsorship tier for everyone that's interested. Depending on your sponsorship, you'll gain access to ZomboDB's private Discord server, pre-built binaries (see below), and professional consulting services.

Even if you never become a sponsor, we sincerely appreciate you using ZomboDB!

Binary Distribution

Starting with this release (v4.0), ZomboDB will no longer be released in binary form. Sponsorship, starting at $75/mo, is how you can access binaries.

The source code, of course, remains open source under the Apache 2 license, and you're free, as always, to clone the repo and build it yourself. We'll work on improving the SOURCE-INSTALLATION.md documentation over time.

Postgres & Elasticsearch Support

Postgres 12 support will be forthcoming. It looks like there's been enough internal Postgres C API changes such that this effort will take a few months of development.

Elasticsearch 7.x support will be forthcoming too. Most of the work, in a draft form, has already been done. Now it boils down to figuring out a clean way to support each of ES 5.6, 6.x, and 7.x from the same code base.
Source code(tar.gz)
Source code(zip)
v5.6.16-1.0.19(Nov 12, 2019)
This is ZomboDB v5.6.16-1.0.19. It is a small bugfix and feature release.

Bug Fixes

Issue #393: Fix highlighting of long proximity clauses

New Feature

Issue #392: Add ability to change the Elasticsearch search_type per query

Downloading

Please download from https://www.zombodb.com/releases
Source code(tar.gz)
Source code(zip)
v5.6.16-1.0.18(Jun 13, 2019)

This is a new release for ZomboDB's Postgres 9.3/4/5 support that requires Elasticsearch 5.6. This release changes the required version of Elasticsearch from 5.6.4 to the last in the 5.6 series: 5.6.16.

ZomboDB no longer supports ES 5.6.4. The decision was made to move to the last version in the ES 5.6 series to avoid https://github.com/elastic/elasticsearch/pull/36770.

To upgrade, first, follow Elasticsearch's standard cluster upgrading process, then install this version of ZomboDB's ES plugin, then follow ZomboDB's standard upgrade progress for the Postgres-side of things. There should no need to reindex any data.

This release also fixes a minor bug in ZomboDB, issue #355.

Downloading

Please download from https://www.zombodb.com/releases/
Source code(tar.gz)
Source code(zip)