Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application. I need to check this out.
Here are the two links I’d start with.
- Welcome to Flume at apache.org?
- The Horton works link has gone away.
I need a logo, and need to build some stuff.