Log Island

LogIsland is an event mining platform based on Spark and Kafka to handle a huge amount of log files.

You can start right now to play with LogIsland through the Docker image, by following the getting started guide

The documentation also explains how to build the source code in order to implement your own plugins.

Once you know how to run and build your own parsers and processors, you'll want to deploy and scale them.

Basic Workflow

Raw log files are sent to Kafka topics by a NIFI / Logstash / Flume / Collectd (or whatever) agent
Logs in Kafka topic are translated into Events and pushed back to another Kafka topic by a Spark streaming job
Events in Kafka topic are sent to Elasticsearch (or Solr or whatever backend) for online analytics (Kibana or Banana) by a Spark streaming job
Log topics can also dumped to HDFS (master dataset) for offline analytics
Event processor do some time window based analytics on events to build new events

Start a log parser

A Log parser takes a log line as a String and computes an Event as a sequence of fields. Let's start a LogParser streaming job with a custom ApacheLogParser. This stream will process log entries as soon as they will be queued into li-apache-logs Kafka topics, each log will be parsed as an event which will be pushed back to Kafka in the li-apache-event topic.

$LOGISLAND_HOME/bin/log-parser \
    --kafka-brokers sandbox:9092 \
    --input-topics li-apache-logs \
    --output-topics li-apache-event \
    --max-rate-per-partition 10000 \
    --log-parser com.hurence.logisland.plugin.apache.ApacheLogParser

Start an event mapper

An event mapper takes an event and serialize it as an Elasticsearch document. Let's start an EventIndexer with a custom mapper. This stream will process event entries as soon as they will be queued into li-apache-event Kafka topics. Each event will be sent to Elasticsearch by bulk.

$LOGISLAND_HOME/bin/event-indexer \
    --kafka-brokers sandbox:9092 \
    --es-host sandbox \
    --index-name li-apache \
    --input-topics li-apache-event \
    --max-rate-per-partition 10000 \
    --event-mapper com.hurence.logisland.plugin.apache.ApacheEventMapper

Start an event processor

//TODO

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
bin		bin
conf		conf
docker		docker
project		project
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGES.md		CHANGES.md
LICENSE.md		LICENSE.md
README.md		README.md
assembly.sbt		assembly.sbt
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Log Island

Basic Workflow

Start a log parser

Start an event mapper

Start an event processor

About

Releases

Packages

Languages

License

lhubert/log-island

Folders and files

Latest commit

History

Repository files navigation

Log Island

Basic Workflow

Start a log parser

Start an event mapper

Start an event processor

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages