Comments (5)
Hi,
I didn't realize the incrementing.field.name should be unique, thanks for mentioning that.
I am using it with a a timestamp but multiple records could have the same time stamp if they are processed at the same time.
I think the secondary sort field makes sense.
from kafka-connect-elasticsearch-source.
Hi @albertwgchu, I've published a pre-release (1.4) with the secondary sort feature.
You can use the incrementing.secondary.field.name
settings to specify the secondary sort field. Note that text fields cannot be sorted, so if you need to sort string fields you will need to use keyword fields.
If everything is fine also for your side, I will mark version 1.4 as stable.
from kafka-connect-elasticsearch-source.
Hi @albertwgchu,
the incrementing incrementing.field.name
should be a strictly increasing field (no duplicates), to avoid the described issue.
So, in the described situation you may loose some records.
Maybe in a new version I can add a secondary field for pagination, to avoid this issue. The _id field would be great, but it is not possible to sort it. However I can add a custom secondary sort field, so let say when you have duplicates n the primary increasing field, the secondary will help in pagination avoiding data losses.
Could it be useful to you?
from kafka-connect-elasticsearch-source.
Cool, I will add configurable secondary sort field in the next version.
from kafka-connect-elasticsearch-source.
I will take a look in the next few days.
from kafka-connect-elasticsearch-source.
Related Issues (20)
- Authentication Methods
- Question: Filtering fields of elastic document.
- question: keystore and truststore file formats
- Avro parsing error StringIndexOutOfBoundsException HOT 1
- Index Prefix is not working as intended, it is copying all the indices
- org.apache.kafka.connect.errors.DataException: xxx is not a valid field name
- Connector failed to run
- The connector starts but fails to connect to OpenSearch nodes
- Update dependencies
- How to disable schema Avro? HOT 1
- Caused by: org.apache.http.ContentTooLongException: entity content is too long [107962506] for the configured buffer limit [104857600] HOT 5
- How to use queries to poll only certain documents into the Kafka Topic? HOT 3
- Now that ElasticSourceConnector is working, how to consume messages using Java from Kafka Topic? HOT 5
- converting list: type not supported HOT 3
- Can I limit the bandwidth of data obtained from ES? HOT 2
- Replay data which already exist on elastic HOT 1
- Connector Vulnerabilities HOT 3
- Connector Vulnerabilities HOT 6
- org.apache.kafka.connect.errors.DataException: Invalid type for INT64: class java.lang.Double HOT 3
- number of tasks, their state and relation to the connector state HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kafka-connect-elasticsearch-source.