In the generation of cloud computing, virtual machines (VMs) have become very common, while the usage of physical machines has started to gradually decrease. In this post, we will introduce the tools, ‘vnStat’ and ‘iftop,’ which we can use to monitor data consumption in a VM. First, I am creating a machine with the following resources: 20 GHz CPU 16 …
Introduction to Hadoop
Data can be referred to as a collection of useful information in a meaningful manner which can be used for various purposes. Аn IT company can use ит for analyzing the productivity of employees over certain set of projects or in a consulting firm and also for predicting the best investment options based on the past. On a more personal …
Installing Hadoop on a Single Node in Five Simple Steps
Welcome to our guide on installing Hadoop in five simple steps. To start with, the Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather …
Extracting Twitter Data Using Python
Twitter is not just an extended source of news, it is by far one of the best samples of the world’s thoughts. With more than 330 million active users, it is one of the top platforms where people like to share their thoughts. Twitter data can be used for a variety of purposes such as research, consumer insights, demographic insights …
NASDAQ Per Minute Data Using Python
If analysis is the body, data is the soul. In our project of stock market analysis based on Twitter sentiments, we selected a few sample companies. We decided what we needed but we still had to cast some spells to get to the core data. As much as we dream about attending Hogwarts, we can’t simply say ‘Accio’ to make …