Big Data Tools
-
Apache Spark
Apache Spark – An Open Source Big Data Tool
The Apache Spark is an open source system for fast and flexible large-scale data analysis. These include interactive exploration of very large datasets, near real-time stream processing, and ad-hoc SQL analytics. It is an extremely fast cluster computing system that can run data in memory. The main advantage of Apache Spark is that it runs 100 times faster than Hadoop Map reduce
Learn More... -
Apache Drill
Apache Drill – An Open Source Big Data Tool
Apache Drill is an open source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. The main feature of Drill is it is able to scale to 10,000 servers or more and to be able to process petabytes of data and trillions of records in seconds.
Learn More... -
D3.js
D3.js – An Open Source Big Data Tool
D3.js is an open source JavaScript library which allows you to manipulate documents that display Big Data. D3 stands for Data Driven Documents. D3 has been designed to be extremely fast, it supports Big Data datasets, and it has cross-hardware platform capability. D3.js is used to create dynamic graphics using Web standards like HTML5, SVG and CSS.
Learn More... -
HCatalog
HCatalog- An Open Source Big Data Tool
HCatalog is an open source metadata and table management framework that works with Hadoop HDFS data. HCatalog is used to liberate Big Data by allowing different tools to share, that means that Hadoop users making use of a tool like Pig or MapReduce or Hive have immediate access to data created with another tool, without any loading or transfer steps.
Learn More... -
Apache Storm
Apache Storm- An Open Source Big Data Tool
Apache Storm is an open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing. Storm is simple, can be used with any programming language.
Learn More...