Striim
Last Updated:
Analyst Coverage: Philip Howard and Daniel Howard
Striim was founded in 2012. Originally known as WebAction, it was rebranded following a significant funding round in 2015. At time of writing, it has recently benefitted from another round of funding that has contributed to its rapid growth throughout 2021. Its offices are in Palo Alto, but it has a global presence, in part due to its robust (and growing) network of partners that includes both systems integrators and technology companies.
Striim
Last Updated: 2nd October 2024
Mutable Award: Gold 2024
Striim is an enterprise-level platform that offers a unified combination of continuous data integration, real-time CDC (Change Data Capture), stream processing, and streaming analytics (see Figure 1). It particularly emphasises the use of integration and streaming activities to continuously deliver large volumes of data to various data platforms, such as cloud data warehouses, in order to support use cases that demand real-time access to data. It is also heavily invested in supporting data architectures that rely on continual data movement. This includes data fabrics and data meshes, as well as many environments that utilise (or plan to utilise) generative AI. In the latter case, Striim is used to continually supply data to your AI model, ensuring that it remains up-to-date. The company utilises generative AI directly to add additional value to its stream processing capabilities, as discussed in more detail below.
Striim boasts strong partnerships with Microsoft and Google, including a strategic migration partnership with the latter and a strategic data partnership with the former. It is a priority partner of Microsoft Fabric, and provides integration with the same. Similarly, the platform integrates with various cloud providers (such as AWS, Google Cloud, and Azure) and cloud data warehouses (such as Snowflake – including Snowpipe Streaming, offering real-time data ingestion into Snowflake – Databricks, and BigQuery), among hundreds of other targets.
Four different licenses are available from Striim: Striim Cloud Enterprise offers the full platform as a fully-managed cloud service; Striim Data Products provides a more specialised deployment, purpose-built for specific data warehouse targets; Streamshift by Striim leverages the platform’s continuous pipelines to drive fully managed and automated, zero downtime database migrations; and there is also a license for self-managed solutions, which is compatible with both on-prem and in-cloud usage.
Customer Quotes
“Thanks to the Striim team, Macy’s was able to move to the GCP Cloud way faster than what was originally thought was possible.”
Macy’s
“Striim and Google Cloud have jointly enabled us to enhance the customer experience with AI and ML.”
UPS
“When we want to build new applications on Striim’s streaming data, it only takes a couple of days — as opposed to months — to deploy.”
Ciena
Striim provides a web-based, graphical, no-code development environment (shown in Figures 2 and 3) with built-in automation and APIs. You can capture data from many sources – including databases (via CDC, which is both non-intrusive and compatible with past and future versions of any given data source), log files, message queues (including Kafka, among others), events, and so on – then build transformation and analytics pipelines that act on that data continually and in real time, enabling a variety of use cases including streaming analytics, data integration, and, more generally, real-time data delivery.
Pipelines are built using Striim’s proprietary SQL-like language, TQL. TQL runs in-memory and, in addition to the filtering, transformation, and aggregation of data, can be used to enrich streaming data with reference data stored in the platform’s distributed, in-memory data grid. There is a facility for users to add extensions written in Java, and the platform supports the import and export of Java analytic models, although not models using R or Python. Similarly, the product does not support PMML (Predictive Modelling Mark-up Language). Tumbling, sliding, and session windows are supported based on record count, time, or data attributes, with additional support for filtering and aggregation, joining streams with historical data, multi-stream correlation, pattern matching, and anomaly detection. There is a library of predictive analytics functions, and streamed data can be persisted on the fly. A wide range of connectors are provided, as is an open processor for extending your pipelines to include third-party code.
This can be used to leverage additional capabilities, such as data quality or governance, within a streaming context. An upcoming version of Striim will also enable partner companies to write their own, custom connectors.
Striim exposes the real-time insights generated by its analytics in various forms, such as alerting (and other notifications) and dashboards (and other visualisations). The latter offers a number of noteworthy features, like the ability to filter real-time data, either by time or by field, and the ability to rewind time-based queries to look at past data. Page- and chart-level searching and filtering are provided, and Striim charts can be embedded into HTML pages.
Striim is highly scalable, performant, and has an elastic architecture, with its distributed execution model combining a continuous query engine, an in-memory data grid, in-memory stream processing, a high-speed, distributed messaging/queuing system, and a results cache built on Elasticsearch. Incoming data streams are shared over the cluster for horizontal scalability, with checkpointing for recovery and restart from the last known good state, providing exactly once processing (E1P) guarantees. It also utilises various integrations with cloud providers, such as multi-threading, that can drive performance.
Much of this serves to reduce latency and support real-time use cases, not the least of which is (generative) AI. In this context, Striim provides real-time embeddings that offer context to each user prompt, allowing for your streams to be enriched with AI-derived insights in real time. Striim intends to further capitalise on this with a series of “AI Insights” features, the first of which (currently in beta) uses AI to discover sensitive/PII data in your streams, then applies existing Striim functionality to obfuscate, encrypt, or tag (so that, for instance, it can be picked up by a third-party tool) that data before it lands in its destination. Matching is based on either regular expressions or Microsoft Presidio, and leverages an OpenAI model of your choice. This dovetails with other Striim capabilities that support regulatory compliance, such as filtering and masking data, maintaining a single customer view that can be used to confirm compliance or detect data breaches, and more general data monitoring, all of which is provided in real time.
Striim, as its name might imply, is more than capable as a streaming analytics and stream processing platform. In addition to more standard streaming analytics use cases (anomaly detection, correlations, pattern matching, and so forth) there are a host of applications that rely on stream processing – such as zero-downtime migrations – that Striim is able to address even though many of its competitors in the streaming space have not, historically, been able to. This is in large part because Striim also offers a fully-fledged data integration capability.
Indeed, despite its name, Striim is a unified streaming and data integration platform, a value proposition that – while not unique – is certainly rare and valuable. Compared to other data integration products, it offers the significant advantage of real-time operation. While not every use case calls for that level of immediacy, there are many that benefit from it, and a fair few that absolutely demand it. Again, this allows Striim to address use cases that other products simply cannot, at least not by themselves.
It is also worth noting the platform’s longstanding support for machine learning and AI, which stretches back to considerably before the recent generative AI boom, and enables you to both deploy AI models as part of your pipelines and train them using streaming data. In particular, the discovery features provided by AI Insights show substantial promise as a way to find sensitive data as it moves about your system.
The bottom line
Striim distinguishes itself with its dual focus on stream processing (including streaming analytics) and data integration, which allows it to address use cases that would be inaccessible to either functionality if deployed alone. This core differentiator is supported by robust integration capabilities, performance, AI support, and more.
Mutable Award: Gold 2024