LogoLogo
HOMEBLOG
  • Getting Started
  • Connect to Data
    • Projects
    • Data Connectors
      • Google BigQuery
      • Google Cloud Storage
      • Iceberg
      • Snowflake
      • AWS S3
      • AWS Athena
      • AWS Redshift
      • Databricks Delta
      • Azure Blob
      • Salesforce
      • SAP Hana
      • File Path Options
      • SQL Server
      • Trino
    • Connection Modes
    • Triggering Scans
    • Configuring a Data Source
  • Profiling Data
    • Data Health Metrics
    • Data Health Overview Page
    • Interactive Profiling Tool: Investigator
    • Data Diff
    • Compound Attributes
      • List of Supported Functions
  • Monitoring Data
    • Data Quality Metrics
    • Alert Policies
    • Data Trends and Alerts
    • Metrics Inspector
  • Data Quality Rules
    • Rules Expression Examples
  • PII Data Detection
  • Remediation
    • Data Binning
    • Circuit Breaker
  • Integrations
    • Jira Integration
    • Slack
    • Jobs Status Notification
  • User Management
    • Okta SSO Setup
    • SSO Configuration
  • API Reference
    • Authentication API
    • API Keys
    • Telmai IP List
    • Get Google Service Account API
  • Source APIs
    • Source APIs
  • Upload Data APIs
    • Upload data from Cloud
      • RedShift Request data
      • GCS Request data
      • Azure Request data
      • GBQ Request data
      • Snowflake Request data
      • Amazon S3 Request data
      • Delta Lake Request
      • Trino Request data
    • Track upload job
    • Check for alerts
  • Admin APIs
    • User Management
  • Telmai Releases
    • Release Notes
      • 25.2.0
      • 25.1.3
      • 25.1.2
      • 25.1.0
Powered by GitBook
On this page
  1. Connect to Data

Configuring a Data Source

PreviousTriggering ScansNextProfiling Data

Last updated 1 month ago

After creating a Data Source, you can optionally configure advanced settings to tailor its behavior.

  1. Configure advanced parameter attributes such as ID, Filters, Segmentation, and Scan Type.

  2. Set the scan schedule according to your requirements.

To access these options, click the 3-dot icon next to the Data Source name:

Advanced Parameters

  1. Click Edit Connection in the context menu

  2. Navigate to the Advanced tab to begin configuration.

Filter Attribute

Use this option to monitor only a subset of data that meets specific criteria. Provide the Filter Attribute Name and a list of values. Telmai will analyze only the records whose attribute values match those in the list.

Segmentation Attribute

Monitor data from different segments independently by specifying a Segmentation Attribute. You can list values to define segments. For example, setting segmentation based on Address.State might define segments for CA, TX, and others. In case of null values you can define it as __null__. Telmai will profile data, calculate metrics, and detect anomalies for each segment as well as for the entire dataset.

Id Attribute

The ID Attribute is optional but provides additional insights, including:

  1. Tracking the uniqueness of records based on IDs.

  2. Displaying record IDs for every anomalous value.

  3. Splitting good records from bad ones using the Data Binning feature.

Timestamp Attribute

The Timestamp Attribute is also optional but can be valuable for:

  • Determining record-level freshness (e.g., percentage of outdated records).

  • Identifying newly added records, enabling Delta-only scans.

When the Timestamp Attribute is provided and the Delta Only checkbox is checked, Telmai will fetch and scan only new records. Note that for some sources, like files or Databricks Delta, Delta detection occurs natively and does not require this attribute.