Posts

Showing posts from August, 2025

A Primer on Web Scraping in R

  If you are a data scientist who wants to capture data from such web pages then you wouldn’t want to be the one to open all these pages manually and scrape the web pages one by one. To push away the boundaries limiting data scientists from accessing such data from web pages, there are packages available in R. The more data you collect, the better your models, but what if the data you want resides on a website? This is the problem of social media analysis when the data comes from users posting content online and can be very unstructured. While there are some websites who support data collection from their web pages and have even exposed packages and APIs (such as Twitter), most of the web pages lack the capability and infrastructure for this. If you are a data scientist who wants to capture data from such web pages then you wouldn’t want to be the one to open all these pages manually and scrape the web pages one by one. To push away the boundaries limiting data scientists from acce...

How to Create Histograms in R

Image
  Histogram in R How to create histograms in R To start off with analysis on any data set,   we plot histograms . Knowing the data set involves details about the distribution of the data and histogram is the most obvious way to understand it. Besides being a visual representation in an intuitive manner. It gives an overview of how the values are spread. We come across many depictions of data using histograms in our day to day life. For example, the distribution of marks in a class can be best represented using a histogram and so does the age distribution in an organization. The good thing about histograms is that it can visualize a large amount of data in a single figure and convey lots of information. It is quite easy to spot the median and mode by looking at histograms. A histogram can also indicate possible outliers and gaps in data. Thus a single figure can help know a lot about data. So in this article, we are going implement different kinds of histograms. Starting w...