Technological partnership

Hadoop : massive data storage and processing

Discover Hadoop, the brainchild of Facebook and Yahoo, for massive data storage and processing. At the heart of Open Source innovation and Big Data.

Smile & Hadoop

Smile has been working for several years on the development of a Big Data center of expertise : training, certification of consultants and developers in Hadoop technologies (Hortonworks Data Platform, Hortonworks Data Flow, Elastic, etc.).

The objective? Transmit all our expertise and know-how around the themes of development, advice and operation of Big Data platforms.

Smile is today recognized for its expertise in setting up and operating platforms, mainly serving major accounts.

The technical subject is outdated to get closer to the professions and work around use cases!

 

Hadoop, power and ease in everyday life

A free and open source framework, Hadoop's main mission is to facilitate distributed data processing . There are several Hadoop distributions, including Hortonworks, Cloudera and MapR.

The Big Data ecosystem is constantly evolving. New products/projects appear on the market every month.

How can businesses maintain stability and reliability in this context?

It is precisely Hadoop distributions that provide this necessary guarantee to secure deployments and ensure the compatibility of solutions with each other.

The dozens of solutions in the Hadoop ecosystem open up the field of possibilities:

  • Operational data warehousing / ODS (HDFS or Hbase) or data warehouse (Hbase and Hive)
  • Parallelized data integration and processing (YARN/Map-Reduce, Pig)
  • Querying and analyzing masses of data (Hive+YARN/Map-Reduce, Pig)
  • Datamining (Mahout)

As a bonus, software can be connected to it such as Apache Pig, Apache Hive, Apache HBase, Apache Phoenix, Apache Spark, Apache ZooKeeper, Cloudera Impala, Apache Flume, Apache Sqoop, Apache oozie or Apache Storm.

Do you want to know more? Dig into the Hadoop topic with articles from the Smile blog!

FEATURES

Version studied

  • 3.0.0

Licence

  • Apache

Language

  • Java

year of creation

  • 2006

Hadoop is a set of Open Source projects and tools from the Apache Foundation for massively storing and processing data.

It was originally developed by Facebook and Yahoo, and is now at the heart of the Big Data innovation and ecosystem.