Fivetran

Last Updated: 17th February 2025
Analyst Coverage: Daniel Howard and Philip Howard

Fivetran was founded in 2012 and originally targeted the business intelligence market. However, it has since re-positioned itself, first towards data integration and more recently towards automated data movement in general. As a provider of data integration (and now data movement) as a managed service, it has seen very significant year-on-year growth. The company has more than 6,000 customers across North America, Europe and the Asia-Pacific, moving over 2,000TB of data monthly with 99.9% uptime. In 2021 it acquired HVR, a high-volume, real-time data replication solution, which it has since put to use, rebranded as Local Data Processing, to deploy its data movement platform ever more readily at an enterprise scale.

The company is backed by venture capitalists and has offices in the United States (including its headquarters), Ireland, Australia, the UK, Germany, Serbia, the Netherlands, and India. It has more than 1,000 employees worldwide and an extensive partner network. The company leverages a product-led, go-to-market strategy via a land and expand model, and its eponymous solution uses scalable consumption-based pricing via Free, Starter, Standard and Enterprise plans as well as Business Critical and Private Deployment (for onprem workloads) deployment structures.

Company Info

Headquarters: 405 14th St, Floor 11, Oakland, CA 94612
Telephone: +1 (415) 805 2799

Fivetran

Last Updated: 18th July 2024
Mutable Award: Gold 2024

What is it?

Fig 01 - Fivetran enterprise data platform

Fivetran is a fully managed, cloud-native solution for automated data movement (most notably data integration) that leverages ELT technology alongside ongoing, micro-batch-based CDC (Change Data Capture) synchronisation. It is designed to sit between your data sources (including legacy data sources) and your downstream applications, effectively acting as a centralised layer for data movement (see Figure 1).

The platform offers self-hosted (specifically, HVR), SaaS, and hybrid deployment models. The latter is an integrated option (currently in beta) that involves hosting the platform in the cloud but processing your data in a secure, on-prem environment. In this way, it aims to give you the best of both worlds. Fivetran offers scalable, consumption-based pricing via its Free, Starter, Standard and Enterprise plans, with premium support available as an add-on. This option is intended to ensure that participating clients receive priority access to customer support, and we suspect it will be particularly attractive to enterprise customers.

Customer Quotes

“We couldn’t even come close to enabling self-service with any of our previous tools and processes.”
Autodesk

“We used to spend 80% of the time moving data over to build campaigns; that’s fallen to 20%.”
Nando’s

“Databricks and Fivetran are the best tools on the market.”
Paul Hewitt

What does it do?

Data integration in Fivetran has three key facets: prebuilt, fully managed connectors; normalisation of the data you are moving; and provision of analysis-ready schemas for target connectors. Between them, the process of data integration (and data movement) is almost completely automatic. Moreover, Fivetran operations are idempotent, meaning that its pipelines are essentially self-correcting: idempotence prevents the creation of duplicate data when data syncs fail. In other words, data integrity will always be maintained. Idempotency is achieved, in part, because Fivetran will automatically add or remove columns whenever there is a schema change. This is enabled by built-in CDC, which can be utilised with or without log-based updates, and in the former case with or without agent-based connectors, as you prefer.

Fivetran currently offers more than 500 fully-managed connectors that have been purpose-built to support a wide variety of data sources, destinations, and use cases. This includes SaaS applications, on-premises and cloud-hosted databases, file systems, cloud data warehouses, event services, legacy data sources, and more. In particular, the product supports various cloud platforms across the ‘big 3’ cloud service providers (AWS, Azure, and Google Cloud), including Snowflake, Databricks, Google BigQuery, Amazon Redshift, and Azure Synapse. Multi-cloud is also supported, as are data lakes hosted on Amazon S3, Azure Data Lake Storage, or OneLake. Facilities to prevent data lakes from becoming data swamps are available as well. For example, you can use Fivetran to convert your big data into an open table format (such as Apache Iceberg or Delta Lake), making curation and compliance much easier by providing some degree of structure (and the enhanced functionality that comes with it). For otherwise unsupported data sources, you can have Fivetran create custom “Lite” connectors via the company’s By Request program. You can also create your own connectors via the provided Custom Connector Framework, and Fivetran partners have access to an SDK to do the same.

The platform is highly extensible and can be integrated with various third-party tools. Notably, this includes data catalogue integration, which may be especially useful for providing additional visibility into your data. Data visibility is further supported by the platform’s metadata sharing and column-level lineage functionality. In addition, the product particularly targets integration with the cloud, and offers various features to support this, such as compatibility with several cloud platforms and minimised compute usage. Past that, the company’s robust partner network provides access to extensive data management capabilities, including data governance, data cataloguing, data masking, and so on. This is enhanced by Fivetran’s ability to automatically propagate associated metadata during data movement, which is particularly useful if you are, say, using it to feed a data catalogue.

The product provides a robust library of pre-built, SQL-based, dbt data models that can transform, join, and calculate connector-loaded data to fill common reporting requirements. Some of these models can be downloaded and orchestrated within Fivetran directly using Quickstart transformations. You can integrate your own dbt project into the platform to orchestrate and manage any custom data models you might have. With both methods, you can synchronise model-run orchestration with connector loads, reducing data latency and computational costs. This is visualised in a data lineage graph, providing observability. Integration with dbt offers version control, logging, alerting, and various other features. Perhaps most notably, this includes data quality functionality that can be built into your data movement pipelines.

Additional capabilities are available, such as support for stream processing (including integration with Apache Kafka), automatic data updates, and automated schema migrations, management, and drift handling. On the latter point, Fivetran will also standardise your schemas for easy querying and API access (for example, by applying deduplication processes). These revamped schemas are fully documented by the product. It also features integrated scheduling for your data movement jobs, and can set transformations to run automatically whenever data is loaded into your system.

What’s more, security and governance are clear priorities for Fivetran. The product is certified against industry best practices and other regulations, including GDPR, SOC2, ISO27001, PCI and HIPAA; it encrypts all data, both in transit and at rest, and data moved into the Fivetran environment is deleted as soon as its data movement workflow is verified; and it provides role-based access control and authorisation, including automated user provisioning (secured via programmatic controls accessed through a REST API), support for Azure AD (Active Directory), single sign-on integration, and various other features. Moreover, the product’s Connect Card functionality lets your users access Fivetran through third-party interfaces and applications without compromising data security or requiring them to interact with the back-end system. By the same token, you can also use it to effectively white label the Fivetran platform.

Governance and regulatory compliance are further supported by the product’s automatic detection of PII within source data. More specifically, Fivetran identifies types of sensitive data within your connector schema, then proactively protects (via column-level masking or blocking) associated data before it lands in the target environment. Although it is currently in private preview, and right now it only supports North American PII definitions, this is still a very promising feature.

Why should you care?

There are a number of reasons to care about Fivetran as a solution for data movement: it is highly automated and easy to use; it is scalable up to hundreds of thousands of tables; it is very reliable, due to both idempotency (effectively guaranteed results) and an extremely high delivery uptime (purportedly 99.9%); and its optimisation of compute usage and attractive pricing options help to minimise the cost of living in the cloud.

The platform readily slots into existing DataOps ecosystems via its wide range of integrations, and data pipeline orchestration is available to further facilitate this. It supports an impressive range of data sources and application environments, with data catalogue integration a particular highlight. It also provides a high level of governance and security, evidenced by its numerous features pertaining to those topics. Its PII detection capability is especially notable, though it is clearly in its infancy.

Finally, Fivetran’s connectors and pre-built transformations models are robust, numerous, and effective. They provide excellent time to value, because everything is already there for you, at a high quality and degree of automation. With the company’s program for creating connectors on request, as well as its partner SDK and Custom Connector Framework, its range of data source support is also quite extensible.

The bottom line

Fivetran is a reliable, automated, easy-to-use, and overall high-value solution for data integration and movement that places particular emphasis on compliance and security. If you are in the market for data movement, it is more than worth your time to check it out.

Mutable Award: Gold 2024

Commentary

Coming soon.

Solutions

Fivetran

Research

00002890 - Data Integration MU cover (Feb 2025)

Fivetran

Company Info

Fivetran

What is it?

What does it do?

Why should you care?

Commentary

Solutions

Research

Best-of-breed Data Integration

Fivetran (2024)

Fivetran (2023)

Data Integration: Fivetran, Gathr, Informatica, Matillion

Fivetran (2022)

(Cloud) Data Management Platforms

Pure-play Data Integration

Fivetran (2021)