Getting Started with Fivetran

This guide will walk you through setting up Fivetran and creating your first data connector.

Prerequisites

Before you start, make sure you have:

A Fivetran account (sign up for free trial)
Access to a data warehouse (Snowflake, BigQuery, Redshift, Databricks, etc.)
Warehouse credentials with permissions to create schemas and tables
Access to a source system you want to sync data from (database, SaaS app, etc.)
Source system credentials with read permissions

Step 1: Create Your Fivetran Account

Sign Up

Visit fivetran.com/signup
Choose your plan:
- Free Trial: 14-day trial with full features
- Starter: For small teams (500k MAR free)
- Enterprise: For larger organizations

Set Up Your Account

Verify your email address
Complete the onboarding questionnaire
Access the Fivetran dashboard

Step 2: Connect Your Data Warehouse (Destination)

Fivetran needs to know where to load your data. Set up your destination first.

Snowflake Example

In Fivetran Dashboard:
- Click Destinations → + Add destination
- Select Snowflake
Create Dedicated Fivetran User in Snowflake:

Enter Connection Details in Fivetran:
- Host: account.region.snowflakecomputing.com
- Port: 443
- User: FIVETRAN_USER
- Password: Your password
- Database: FIVETRAN_DB
- Connection Method: Choose authentication type
Test Connection: Click "Test" to verify connectivity

BigQuery Example

In Google Cloud Console:
- Create a service account for Fivetran
- Grant roles: BigQuery Data Editor, BigQuery Job User
- Download JSON key file
In Fivetran Dashboard:
- Click Destinations → + Add destination
- Select BigQuery
- Upload service account JSON key
- Select project and dataset
- Test connection

Other Supported Warehouses

Amazon Redshift
Azure Synapse Analytics
Databricks
PostgreSQL
MySQL
Snowflake
Google BigQuery
And more...

Step 3: Create Your First Connector (Source)

Now let's connect a data source. We'll use PostgreSQL as an example.

Example 1: PostgreSQL Database Connector

1. Prepare Your PostgreSQL Database

Enable logical replication for CDC (optional but recommended):

Create Fivetran user with read permissions:

2. Configure Connector in Fivetran

In Fivetran Dashboard:
- Click Connectors → + Add connector
- Search and select PostgreSQL
Enter Connection Details:
- Host: Your database hostname
- Port: 5432 (default)
- User: fivetran_user
- Password: Your password
- Database: your_database
Choose Connection Method:
- Direct Connection: If Fivetran can reach your database directly
- SSH Tunnel: For databases behind firewalls
- Reverse SSH Tunnel: For maximum security
Configure Sync Settings:
- Sync Frequency: Every 6 hours (default) - adjust based on needs
- Replication Method:
  - Logical Replication (CDC) - real-time, most efficient
  - XMIN - good for smaller databases
  - Batch - full table scans
Select Tables to Sync:
- Choose schemas and tables to include/exclude
- Review column selection
- Set primary keys if not detected
Test & Save: Test connection and save connector

Example 2: SaaS Application (Salesforce)

1. In Fivetran Dashboard:

Click Connectors → + Add connector
Search and select Salesforce

2. Authenticate:

Click Authorize to connect via OAuth
Log in to Salesforce
Grant permissions to Fivetran

3. Configure Sync:

Sync Frequency: Every 6 hours (or custom)
API Type:
- Bulk API - for historical loads (recommended)
- REST API - for smaller data volumes
Objects: Select Salesforce objects (tables) to sync
- Standard objects: Account, Contact, Opportunity, Lead, etc.
- Custom objects: Your custom Salesforce objects

4. Historical Sync:

Choose historical sync range (all time, last year, etc.)
Initial sync may take hours depending on data volume

5. Test & Save

Step 4: Monitor Your First Sync

Understanding Sync Status

Go to Connector Dashboard:
- View sync history
- Monitor current sync progress
- Check for errors or warnings
Sync Phases:
- Historical Sync: Initial load of all data (first sync only)
- Incremental Sync: Ongoing syncs of new/changed data
- Schema Update: When source schema changes are detected
Key Metrics to Watch:
- Rows synced: Number of rows updated
- Duration: Time taken for sync
- MAR (Monthly Active Rows): For billing purposes
- Warnings/Errors: Any issues encountered

Check Data in Your Warehouse

After the sync completes, verify data has arrived:

Snowflake:

BigQuery:

Step 5: Understand Fivetran Schema Conventions

Schema Naming

Fivetran creates schemas in your warehouse based on the connector:

salesforce - Salesforce data
google_analytics - Google Analytics data
postgres_public - PostgreSQL public schema
mysql_db_name - MySQL database

Table Naming

Tables retain their source names (lowercase)
Nested/semi-structured data is flattened into relational tables
Arrays become separate tables with foreign keys

System Columns

Fivetran adds metadata columns to every table:

_fivetran_synced - Timestamp when row was last synced
_fivetran_deleted - Boolean indicating if row was deleted (soft deletes)
_fivetran_id - Unique identifier for deduplication (when no primary key exists)

Example Schema Structure

Step 6: Set Up Transformations (Optional)

Fivetran integrates natively with dbt for transformations.

Enable dbt Transformations

In Fivetran Dashboard:
- Navigate to Transformations
- Click + Create Transformation
- Choose dbt Core or dbt Cloud
Connect Your Git Repo:
- Link GitHub, GitLab, or Bitbucket
- Select repository with dbt project
- Choose branch to run
Configure Run Schedule:
- After each connector sync - Most common
- On a schedule - Independent of syncs
- Manual - Run on-demand only
Environment Variables:
- Set dbt profiles and environment variables
- Configure warehouse connection for transformations

Use Fivetran dbt Packages

Fivetran provides pre-built dbt packages for common sources:

These packages include:

Pre-built staging and mart models
Standard transformations and enrichments
Tests and documentation
Best practice SQL patterns

Step 7: Configure Alerts and Notifications

Stay informed about your data pipelines:

In Fivetran Dashboard:
- Go to Settings → Notifications
Configure Alert Channels:
- Email: Send to team members
- Slack: Post to channels
- PagerDuty: For critical alerts
- Webhooks: Custom integrations
Alert Types:
- Sync failures: When a connector fails
- Schema changes: When source schema changes
- Data volume anomalies: Unusual row counts
- Sync delays: When syncs take longer than expected
- Connector warnings: Non-fatal issues

Common Setup Patterns

Pattern 1: Multi-Environment Setup

Separate dev/staging/prod environments:

Tips:

Use separate Fivetran connectors for each environment
Sync production more frequently (hourly)
Sync staging less frequently (daily) to save costs
Use Fivetran's duplicate connector feature

Pattern 2: Multi-Region Setup

For global companies with regional data:

Pattern 3: Hybrid Cloud Setup

Mix of cloud warehouses:

Snowflake: For operational analytics
BigQuery: For machine learning workloads
Fivetran: Sync same sources to both warehouses

Troubleshooting Common Issues

Issue: "Connection Timeout"

Causes:

Firewall blocking Fivetran IPs
Incorrect host/port
Network connectivity issues

Solutions:

Whitelist Fivetran IP addresses
Verify host and port are correct
Use SSH tunnel or Reverse SSH for databases behind firewalls
Test connectivity with telnet or nc from allowed network

Issue: "Permission Denied"

Causes:

Insufficient database permissions
User doesn't have read access to tables
Warehouse permissions not granted

Solutions:

Review and grant necessary permissions (see Step 3)
Verify user can query tables manually
Check warehouse-specific requirements in Fivetran docs

Issue: "High MAR Usage"

Causes:

Syncing unnecessary tables
Full table replication instead of incremental
High update frequency on large tables

Solutions:

Exclude unnecessary tables and columns
Switch to incremental or CDC sync modes
Reduce sync frequency for less critical data
Review MAR breakdown in Fivetran dashboard

Issue: "Schema Drift Warnings"

Causes:

Source schema changed (column added/removed)
Data type changes in source

Solutions:

Review schema change notifications
Approve schema changes in Fivetran dashboard
Update downstream dbt models if needed
Enable auto-approval for trusted sources

Next Steps

Congratulations! You've set up your first Fivetran connector. Here's what to do next:

Add More Connectors:
- Identify other critical data sources
- Set up additional connectors following same process
Optimize Sync Schedules:
- Align with business needs
- Balance freshness vs. cost
Set Up Transformations:
- Install dbt if not already done
- Use Fivetran dbt packages as starting point
- Build custom transformations (dbt guide)
Implement Monitoring:
- Set up alerts for failures
- Monitor MAR usage
- Track sync performance
Learn Best Practices:
- Review Fivetran best practices
- Explore use cases
- Try hands-on tutorials

Getting Help

Fivetran Support

Documentation: fivetran.com/docs
Support Portal: Available to paid customers
Status Page: status.fivetran.com

Community Resources

Fivetran Community: community.fivetran.com
Knowledge Base: Searchable help articles
Slack: Join data engineering communities

Need Expert Help?

Setting up a robust data integration platform involves many considerations: security, cost optimization, connector selection, and architecture design. Contact me for consulting on Fivetran implementation and ELT architecture.

Getting Started with Fivetran

Getting Started with Fivetran

Prerequisites

Step 1: Create Your Fivetran Account

Sign Up

Set Up Your Account

Step 2: Connect Your Data Warehouse (Destination)

Snowflake Example

BigQuery Example

Other Supported Warehouses

Step 3: Create Your First Connector (Source)

Example 1: PostgreSQL Database Connector

1. Prepare Your PostgreSQL Database

2. Configure Connector in Fivetran

Example 2: SaaS Application (Salesforce)

1. In Fivetran Dashboard:

2. Authenticate:

3. Configure Sync:

4. Historical Sync:

5. Test & Save

Step 4: Monitor Your First Sync

Understanding Sync Status

Check Data in Your Warehouse

Step 5: Understand Fivetran Schema Conventions

Schema Naming

Table Naming

System Columns

Example Schema Structure

Step 6: Set Up Transformations (Optional)

Enable dbt Transformations

Use Fivetran dbt Packages

Step 7: Configure Alerts and Notifications

Common Setup Patterns

Pattern 1: Multi-Environment Setup

Pattern 2: Multi-Region Setup

Pattern 3: Hybrid Cloud Setup

Troubleshooting Common Issues

Issue: "Connection Timeout"

Issue: "Permission Denied"

Issue: "High MAR Usage"

Issue: "Schema Drift Warnings"

Next Steps

Getting Help

Fivetran Support

Community Resources

Need Expert Help?

Stay in the loop

Fivetran

Fivetran Best Practices