You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/backfill-cli.adoc
+6-12Lines changed: 6 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@
3
3
When CDC is enabled on a table, the data topic doesn't contain any data from before CDC was enabled.
4
4
The backfill CLI solves this problem by exporting the table's primary key to a Comma Separated Values (CSV) file, storing the CSV file on disk, and sending the primary key from the CSV file to the event topic.
5
5
The Cassandra Source Connector reads the primary key from the event topic and populates the data topic with historical data.
6
-
The backfill CLI is powered by the https://docs.datastax.com/en/dsbulk/docs/reference/dsbulkCmd.html[DataStax Bulk Loader], a battle-tested data loader tool. This means the CLI takes full advantage of optimizations done in DSBulk when exporting data from table to disk.
6
+
The backfill CLI is powered by the xref:dsbulk:overview:dsbulk-about.adoc[DataStax Bulk Loader], a battle-tested data loader tool. This means the CLI takes full advantage of optimizations done in DSBulk when exporting data from table to disk.
7
7
8
8
Developers can also use the backfill CLI to trigger change events for downstream applications without having to insert new data.
9
9
@@ -64,7 +64,7 @@ The Pulsar-admin extension is packaged with the IBM Elite Support for Apache Pul
64
64
65
65
. Move the generated NAR archive to the /cliextensions folder of your Pulsar installation (e.g. /pulsar/cliextensions).
66
66
. Modify the client.conf file of your Pulsar installation to include: `customCommandFactories=cassandra-cdc`.
67
-
. Run the following command (this assumes the https://docs.datastax.com/en/installing/docs/installTARdse.html[default installation] of DSE Cassandra):
67
+
. Run the following command (this assumes the https://docs.datastax.com/en/dse/6.8/installing/tarball-dse.html[default tarball installation of DSE]):
68
68
+
69
69
[source,shell]
70
70
----
@@ -80,11 +80,11 @@ This test quickly confirms your CDC backfill is working correctly.
80
80
81
81
*Prerequisites:*
82
82
83
-
* A running https://docs.datastax.com/en/installing/docs/installTARdse.html[DSE Cassandra cluster]
83
+
* A running DSE cluster
84
84
* A running Pulsar cluster (https://pulsar.apache.org/docs/getting-started-standalone/[standalone] is fine)
85
85
* Backfill CLI built with Gradle (see <<install>>)
86
86
87
-
. Start DSE Cassandra from the https://docs.datastax.com/en/installing/docs/installTARdse.html[installation directory].
87
+
. Start DSE:
88
88
+
89
89
[source,bash]
90
90
----
@@ -299,7 +299,7 @@ value.
299
299
|An extra DSBulk option to use when exporting. Any valid DSBulk option
300
300
can be specified here, and it will be passed as-is to the DSBulk
301
301
process. DSBulk options, including driver options, must be passed as
302
-
'--long.option.name=<value>'. Short options are not supported. For more DSBulk options, see https://docs.datastax.com/en/dsbulk/docs/reference/commonOptions.html[here].
302
+
'--long.option.name=<value>'. Short options are not supported.
303
303
304
304
|--export-host=HOST[:PORT]
305
305
|The host name or IP and, optionally, the port of a node from the
@@ -380,10 +380,4 @@ These parameters should be passed as command line arguments in the standalone Ja
380
380
|The path to the trusted TLS certificate file.
381
381
|--pulsar-ssl-use-key-store-tls
382
382
|If TLS is enabled, specifies whether to use KeyStore type as TLS configuration parameter.
* For more on using CDC with Apache Pulsar, including schema management and consumption patterns, see our https://docs.datastax.com/en/streaming/streaming-learning/use-cases-architectures/change-data-capture/index.html[Streaming learning page].
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/cdcExample.adoc
+6-7Lines changed: 6 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,9 +4,8 @@ Capture schema changes in your C* tables and pass them to Apache Pulsar(R) with
4
4
5
5
This installation requires the following. Latest version artifacts are available https://github.com/datastax/cdc-apache-cassandra/releases/latest[here]. Use image:https://img.shields.io/github/v/release/datastax/cdc-apache-cassandra?color=green&display_name=tag[link="https://github.com/datastax/cdc-apache-cassandra/releases/latest"] for the latest version.
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/index.adoc
+1-6Lines changed: 1 addition & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -159,9 +159,4 @@ To ensure the data sent to all datacenters are delivered to the data topic, make
159
159
For example, given a Cassandra cluster with three datacenters (DC1, DC2, and DC3), you would enable CDC and install the change agent in only DC1.
160
160
To ensure all updates in DC2 and DC3 are propagated to the data topic, configure the table's keyspace to replicate data from DC2 and DC3 to DC1.
161
161
For example, `replication = {'class': 'NetworkTopologyStrategy', 'dc1': 3, 'dc2': 3, 'dc3': 3})`.
162
-
The data replicated to DC1 will be processed by the change agent and eventually end up in the data topic.
163
-
164
-
== What's next?
165
-
166
-
* For more on using CDC with Apache Pulsar, including schema management and consumption patterns, see our https://docs.datastax.com/en/streaming/streaming-learning/use-cases-architectures/change-data-capture/index.html[Streaming learning page].
167
-
* If you've got more questions about {cdc_cass_first}, see xref::faqs.adoc[].
162
+
The data replicated to DC1 will be processed by the change agent and eventually end up in the data topic.
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/install.adoc
+4-18Lines changed: 4 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,15 +1,6 @@
1
1
= Installing {cdc_cass} for VM deployment
2
2
3
-
== Download the DataStax Change Data Capture (CDC) Agent for Apache Cassandra(R)
4
-
5
-
[IMPORTANT]
6
-
====
7
-
By downloading this DataStax product, you agree to the terms of the open-source https://www.apache.org/licenses/LICENSE-2.0[Apache-2.0 license agreement].
8
-
====
9
-
10
-
Perform the following steps:
11
-
12
-
. Download the change agent tar file from the https://downloads.datastax.com/#cassandra-change-agent[DataStax downloads page]. +
3
+
. Download the `cassandra-source-agents` tar file from the https://github.com/datastax/cdc-apache-cassandra/releases[{cdc_cass} GitHub repository].
13
4
The following files are available in the tar file:
By downloading this DataStax product, you agree to the terms of the open-source https://www.apache.org/licenses/LICENSE-2.0[Apache-2.0 license agreement].
95
-
====
96
-
97
-
Download the `cassandra-source-connectors-<version>.tar` file from the https://downloads.datastax.com/#cassandra-source-connector[DataStax downloads page].
83
+
Download the `cassandra-source-connectors` tar file from the https://github.com/datastax/cdc-apache-cassandra/releases[{cdc_cass} GitHub repository].
98
84
99
85
For Apache Pulsar and IBM Elite Support for Apache Pulsar (formerly DataStax Luna Streaming) 2.8, the `pulsar-cassandra-source-<version>.nar` file is available.
100
86
@@ -198,7 +184,7 @@ The following table identifies functionally equivalent {cdc_pulsar} and DataStax
198
184
NOTE: If you define both in your configuration, the {cdc_pulsar} setting take precedence over the `datastax-java-driver.property-name`.
199
185
If you do not provide either in your configuration, {cdc_pulsar} defaults are in effect.
200
186
201
-
For information about the Java properties, refer to the link:https://docs.datastax.com/en/developer/java-driver-dse/2.3/manual/core/configuration/[DataStax Java driver documentation].
187
+
For information about the Java properties, refer to the https://docs.datastax.com/en/developer/java-driver/4.3/manual/core/configuration/reference/index.html[DataStax Java driver documentation].
202
188
203
189
|===
204
190
| {csc_pulsar_first} | Using datastax-java-driver prefix
For more information, refer to the link:https://docs.datastax.com/en/developer/java-driver/4.3/manual/core/configuration/reference/[Java driver reference configuration] topic.
227
+
For more information, refer to the https://docs.datastax.com/en/developer/java-driver/4.3/manual/core/configuration/reference/index.html[Java driver reference configuration] topic.
* The change agent exposes metrics with https://docs.datastax.com/en/landing_page/doc/landing_page/metricsandalerts.html[JMX], a technology within Java that provides tools for managing and monitoring applications.
150
-
* https://docs.datastax.com/en/opscenter/6.8/[DSE Ops Center] can collect these exposed metrics for visualization and alerts, and pass them on to https://docs.datastax.com/en/monitoring/doc/monitoring/opsUseMetricsCollector.html[DSE Metrics Collector] for additional integration with Prometheus and Grafana.
149
+
* The change agent exposes metrics with https://docs.datastax.com/en/planning/dse/metrics-alerts.html[JMX], a technology within Java that provides tools for managing and monitoring applications.
150
+
* https://docs.datastax.com/en/opscenter/6.8/overview/opscenter-about.html[DSE Ops Center] can collect these exposed metrics for visualization and alerts, and pass them on to https://docs.datastax.com/en/monitoring/ops-use-metrics-collector.html[DSE Metrics Collector] for additional integration with Prometheus and Grafana.
151
151
* The https://github.com/datastax/metric-collector-for-apache-cassandra[Metrics Collector for Apache Cassandra] with Prometheus and Grafana dashboards provides the same functionality as DSE Metrics Collector, built on the well-supported collectd agent.
152
152
* Other monitoring tools like https://github.com/prometheus/jmx_exporter[JMX Exporter] by Prometheus are available, but may require additional tuning.
0 commit comments