Big Data Case Study: How a FTSE100 FinTech adopted Data Science

Big Data Case Study: How a FTSE100 FinTech adopted Data Science
To stay competitive and innovate large organisations need to adopt Big Data, Advanced Analytics and Data Science capabilities and technologies. Ideally, that happens fast and safely with manageable costs and early business outcomes. The requirements and necessary capabilities are uncertain, though, and the appropriate technologies and implementation plan unresolvable. In that situation, many organisations jump ... read more →

Faster Big Data on Hadoop with Hive and RCFile 5

Faster Big Data on Hadoop with Hive and RCFile
SQL on Hadoop with Hive makes Big Data accessible. Yet performance can lack. RCFile (Record Columnar File) are great optimisation for Big Data with Hive. The previous two posts in this four parts series explained the reasons why to use text on the periphery of an ETL process and optimisations for text. The inside of a Hive ... read more →

Optimising Hadoop and Big Data with Text and Hive

Optimising Hadoop and Big Data with Text and Hive
Hadoop’s Hive SQL interface reduces costs and to gets results fast with Big Data from Text. Simple optimisations improve the performance significantly. The previous post Getting Started with Big Data with Text and Apache Hive described the case for using text format to import and export data for a Hive ETL and reporting process. These ... read more →

Getting Started with Big Data with Text and Apache Hive 2

Getting Started with Big Data with Text and Apache Hive
Big Data more often than expected is stored and exchanged as text. Apache Hadoop’s Hive SQL interface helps to reduce costs and to get results fast. Often, things have to get done fast rather than perfectly. However, with big data even a small decision like a file format could have a great impact. What are ... read more →

9+ Free Online Data Science Resources You Should Know 1

Data Science is a hot topic and there are plenty of courses and resources available for anyone interested. Try out these 9 free resources to get started if you are new to the topic or want to refresh on one of the subjects. read more →

Voronoi Tessellation

The Voronoi Tesselation (or Voronoy Tessellation) by Georgy Feodosevich Voronoy/Вороной Георгий Феодосьевич (1908) is a technique that enables the division of a such multi-dimensional spaces into subspaces. Its application defines geometric areas equivalent to subspaces by defining several vectors as centres of subspaces. Any other vector in space can then be attributed to the closest centre ... read more →