Big Data - Further Information
This page shows up to 100 pieces of content (newest at the top):
Cray Systems and the Cray Graph Engine
The Cray Graph Engine is an RDF database that runs on a variety of Cray hardware platforms.
ArangoDB
ArangoDB is a multi-model database that supports document (JSON), key-value and property graph capabilities with one database core and one declarative query language.
Managing data lakes: building a business case
This is a companion paper to one we published in 2017. We outline a methodology for building a business case in support of implementing suitable data lake management software.
Trendalyze (June 2018)
Trendalyze describes its core capability as the discovery of motifs (and anomalies) within time series data. You can think of a motif as a micro-pattern but it is more accurately a shape. Once a motif of interest is discovered, or…
SQL Engines on Hadoop
There are many SQL on Hadoop engines, but they are suited to different use cases: this report considers which engines are best for which sets of requirements.
Data Lake Management
There are various factors needed to prevent a data lake becoming a swamp.
Big data and the mainframe - issues and opportunities
The purpose of this paper is to examine those issues, which arise when big data implementations transition beyond skunk works and into general-purpose use.
The Chief Data Officer: getting the basics right
Before a CDO can think sensibly about what data the business might want to leverage they must get a handle on the data assets that the company already possesses.
Managing Data Lakes
This paper discusses why data lakes need to be managed and the sorts of capabilities that are required to manage them.
All about graphs: a primer
Over the last few years graph databases have been the fastest growing sector within the database market ...
Graph and RDF databases 2016
This Market Report discusses the latest trends in this market, along with a detailed assessment of the leading vendors in the market
Graph and RDF databases Market Update 2016
This Market Update discusses the latest trends in this market, along with our assessment of the leading vendors in the market.
IBM Informix and the Internet of Things
This paper discusses the IBM Informix database and its suitability for deployment within Internet of Things (IoT) environments.
Total cost of ownership
TCO should be more important in decision making than either license fees or subscription costs.
DATUM - a value-driven approach to building the digital enterprise
In this paper we will discuss why we believe that understanding the business value of data is fundamental to a successful digital transformation.
All things Hadoop
Discussing the Open Data Platform and Apache Spark
The Internet of Things Reference Model
The World Forum Architecture Committee has published an IoT reference model
Product Information Management (PIM)
I often get emails from vendors talking about a whitepaper or other sales document. Sometimes these are very useful simple guides to a subject.
IBM: enhanced 360° view
IBM is in the vanguard for what it calls an enhanced 360° view and it is clearly well positioned to capitalise on the future growth of this market.
Extending a 360° view
In this paper we will discuss why we believe that extending the traditional 360° view makes sense and we will give some uses that demonstrate why the extended it represents an opportunity.
Kdb+ and the Internet of Things/Big Data
Kdb+ is a column-based relational database with extensive in-memory capabilities, developed and marketed by Kx Systems.
Creating confidence in Big Data analytics
There has been some significant criticism of the concept of big data recently, notably in the Harvard Business Review criticising the Google Flu Trends...
Considering the small in big data
Not all of the issues addressed by big data need big data solutions
Kognitio: clarifying misunderstandings
There aspects of Kognitio and its offering that are sometimes misunderstood, so I thought I should clear some things up.
Big data security
The third issue for big data is ensuring that the data is secure and compliant. There are also ethical issues.
Big data context
The second issue for big data is understanding the context of the data
Big data trust
The first issue for big data is how much you trust the data
TIBCO transforms big data into big opportunity
TIBCO came to London for their user conference (transFORM2013). This year's theme was all about big data and TIBCO's senior executives outlined their strategy for their platform.
Calling a spade a spade
Preventative maintenance and asset optimisation are not the same thing
IBM JSON
There's been some confusion about how exactly DB2 is supporting JSON: here's the lowdown
Big Data governance and EU data law – Part 2 - I talk to the experts
Further thoughts on Data Protection issues and Big Data.
Big Data governance and EU data law – Part 1 - I raise some questions – and highlight some resources
Individuals' consent is 'almost always' required by firms when using personal data in big data projects centred on profiling and that's a governance issue which perhaps needs legal, as much as IT, advice
Harnessing big data for security - what are the key considerations and capabilities?
This report discusses some of the challenges of harnessing big data security and outlines some of the key considerations and capabilities that organisations should consider.
Key considerations for security intelligence in big data - what a CISO needs to know
This document discusses the need for an intelligence-driven security approach and aims to provide pointers for security executives.
Big Data bias - An analysis of recent research from Varonis
Varonis has published some research into big data - but just looking at the press release is misleading
CEP and Big Data 2
Should it be called CEP? Is CEP only about real-time BI? These were questions we answered 6 years ago. Also, a mention of some Hadoop-based CEP engines.
Breakthrough and instrumented applications
The sort of data that is common in big data scenarios can be exploited in other ways too
Another look at big data
The second in the series on big data
What is big data?
The first of a series of articles on big data (What is Hadoop? was a preface).
Informatica Data Replication for real-time (big) data warehousing
While real-time analytics is becoming more and more urgent for many organisations the ability to accomplish this can easily be constrained by the volume of data that needs to be analysed.
Challenging Cloudera
Cloudera has been the default standard for enterprise Hadoop implementation but perhaps not any longer.
Big data
There is too much confusion around the whole idea of big data