Elasticsearch

Elasticsearch is a distributed, RESTful search and analytics engine designed for fast and scalable data retrieval and storage.

Bruin supports Elasticsearch both as a source and as a destination for Ingestr assets. You can use it to ingest data from Elasticsearch into your data warehouse, or load data from other sources into Elasticsearch.

In order to set up Elasticsearch connection, you need to add a configuration item in the .bruin.yml file and in asset file.

Follow the steps below to correctly set up Elasticsearch as a data source and run ingestion.

Configuration

Step 1: Add a connection to .bruin.yml file

To connect to Elasticsearch, you need to add a configuration item to the connections section of the .bruin.yml file. This configuration must comply with the following schema:

yaml

  elasticsearch:
    - name: "elasticsearch"
      username: "username_123"
      password: "pass_123"
      host: "localhost"
      port: 5000
      secure: "false"
      verify_certs: "false"

username: The username used to authenticate with Elasticsearch.
password: The password associated with the specified username.
host: The host address of the Elasticsearch server.
port: The port number used by the Elasticsearch server.
secure: Enables HTTPS when set to true. By default, it is true.
verify_certs: Verifies TLS certificates when set to true. By default, it is true.

Step 2: Create an asset file for data ingestion

To ingest data from Elasticsearch, you need to create an asset configuration file. This file defines the data flow from the source to the destination. Create a YAML file (e.g., Elasticsearch_ingestion.yml) inside the assets folder and add the following content:

yaml

name: public.elasticsearch
type: ingestr
connection: postgres

parameters:
  source_connection: elasticsearch
  source_table: 'index-name'

  destination: postgres

name: The name of the asset.
type: Specifies the type of the asset. Set this to ingestr to use the ingestr data pipeline.
connection: This is the destination connection, which defines where the data should be stored. For example: postgres indicates that the ingested data will be stored in a Postgres database.
source_connection: The name of the Elasticsearch connection defined in .bruin.yml.
source_table: The name of the Elasticsearch index from which you want to ingest data.

Step 3: Run asset to ingest data

bash

bruin run assets/elasticsearch_ingestion.yml

As a result of this command, Bruin will ingest data from the given Elasticsearch table into your Postgres database.

Using Elasticsearch as a Destination

Elasticsearch can also be used as a destination to load data from other sources. This is useful for building search indexes or consolidating data for analytics.

Example: Loading data into Elasticsearch

To use Elasticsearch as a destination, create an asset file that specifies Elasticsearch as the destination:

yaml

name: elasticsearch.my_index
type: ingestr
connection: elasticsearch

parameters:
  source_connection: postgres
  source_table: 'public.my_table'

  destination: elasticsearch

When you run this asset, Bruin will:

Load data from the source into the specified Elasticsearch index
Create the index automatically if it doesn't exist
Use a 'replace' strategy, which deletes the existing index before loading new data

Important Notes:

For cloud Elasticsearch instances, HTTPS (port 443) is typically used and the secure parameter defaults to true
For local Elasticsearch instances without HTTPS, set secure: "false" in the connection configuration
The target index will be created with the full name specified in the asset (e.g., name: elasticsearch.my_index creates an index called elasticsearch.my_index)

Introduction

Core Concepts

Templates

VS Code Extension

Panels Overview

Side Panel

Dashboard

Jinja Templating

Elasticsearch

Configuration

Step 1: Add a connection to .bruin.yml file

Step 2: Create an asset file for data ingestion

Step 3: Run asset to ingest data

Using Elasticsearch as a Destination

Example: Loading data into Elasticsearch

Panels Overview

Side Panel

Dashboard

Elasticsearch ​

Configuration ​

Step 1: Add a connection to .bruin.yml file ​

Step 2: Create an asset file for data ingestion ​

Step 3: Run asset to ingest data ​

Using Elasticsearch as a Destination ​

Example: Loading data into Elasticsearch ​

Elasticsearch

Configuration

Step 1: Add a connection to .bruin.yml file

Step 2: Create an asset file for data ingestion

Step 3: Run asset to ingest data

Using Elasticsearch as a Destination

Example: Loading data into Elasticsearch