Data analytics and democracy

Today, privacy campaigners have been circulating tweets about Brittany Kaiser’s document dump proving that Cambridge Analytics and AIG were working as a single entity. This would seem somewhat contrary to what the police and the ICO have been able to find. There’s more to find here and when I find it, I’ll post the stories…

Read more

Big Data

I am once again trying to write my blog on solutions architecture and the GDPR. I looked up “Data Lake” again and came across some very good resources in a You Tube channel from Intricity. This summarises the design bifurcation between distributed data sources and unified query logic. It’s five years old. He or his…

Read more

Apache Flume

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that…

Read more