Data analytics and democracy

Today, privacy campaigners have been circulating tweets about Brittany Kaiser’s document dump proving that Cambridge Analytics and AIG were working as a single entity. This would seem somewhat contrary to what the police and the ICO have been able to find. There’s more to find here and when I find it, I’ll post the stories here. Continue reading “Data analytics and democracy”

Big Data

I am once again trying to write my blog on solutions architecture and the GDPR. I looked up “Data Lake” again and came across some very good resources in a You Tube channel from Intricity.

This summarises the design bifurcation between distributed data sources and unified query logic. It’s five years old. He or his teachers got there first i.e. before me.

I also had a quick look at “Born in the Cloud”, and “Why Hadoop is dying!”

All very insightful. Continue reading “Big Data”

Apache Flume

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application. I need to check this out. Continue reading “Apache Flume”