Integrating the LVM with Hadoop and provide the elasticity to DataNode storage

4 min readMar 16, 2021

Hello guys, I am again back with my new post on integrating the LVM with the Hadoop. In this article I gonna discuss about the what is LVM? what is Hadoop? How it helps in integrating the LVM with Hadoop?

What is LVM?

LVM means Logical Volume Management. LVM is a tool for logical volume management which includes allocating disks, striping, mirroring and resizing logical volumes.

The physical volumes are combined into logical volumes, with the exception of the /boot partition. The /boot partition cannot be on a logical volume group because the boot loader cannot read it. If the root (/) partition is on a logical volume, create a separate /boot partition which is not a part of a volume group.

What is Hadoop?

Hadoop is an open source tool from Apache. Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Why to integrate LVM with Hadoop?

On integrating the LVM with hadoop it provides the elasticity to the data node we can increase or decrease the storage at any point of time dynamically.