Mayaaa Singh

Greenhorn
+ Follow
since Sep 09, 2017
Merit badge: grant badges
For More
Cows and Likes
Cows
Total received
0
In last 30 days
0
Total given
0
Likes
Total received
0
Received in last 30 days
0
Total given
0
Given in last 30 days
0
Forums and Threads
Scavenger Hunt
expand Ranch Hand Scavenger Hunt
expand Greenhorn Scavenger Hunt

Recent posts by Mayaaa Singh

HBase is a non-relational column oriented distributed database that runs on top of HDFS. It is a NoSQL open source database in which data is stored in rows and columns. Cell is the intersection of rows and columns.

To track changes in the cell, versioning makes it possible to retrieve any version of contents. Versioning makes difference between HBase tables and RDBMS.

Each cell value includes a “version” attribute, which is nothing more than a timestamp uniquely identifying the cell. Each value in the map is an uninterrupted array of bytes.

The map is indexed by a row key, column key, and a timestamp. Implementations of HBase are highly scalable, sparse, distributed, persistent, and multidimensional-sorted maps.

HBase real world use cases:

  • Medical
    Sports
    Web




  • 6 years ago
    It's really good that you want to learn Hadoop. It's a really good time to start your career in Big Data Hadoop field. Hadoop framework is written in Java. So, you have an added advantage because you know Java.
    For learning Hadoop, start with the basics and then move ahead with the complex part. First learn the challenges of Big Data, why we should learn big data etc. then move ahead with Hadoop. Start with the Hadoop introduction then learn its 3 main parts i.e HDFS, MapReduce and Yarn. After that learn PIG, HIVE, HBase and try to write its queries. If you know SQL then you can easily write HIVE queries
    HIVE + SQL = HiveQL
    Hive use language called HiveQL (HQL), which is similar to SQL. HiveQL automatically translates SQL-like queries into MapReduce jobs.
    Hope this helps
    6 years ago