Posts

Showing posts from September, 2025

Implementing Principal Component Analysis (PCA) in R

Image
“Give me six hours to chop down a tree and I will spend the first four sharpening the axe.” – Abraham Lincoln This timeless quote applies beautifully to machine learning and data science. The success of any predictive model often depends less on the choice of algorithm and more on the effort spent in data preprocessing, cleaning, and feature engineering. One key part of feature engineering is deciding which features actually add value. This is where dimensionality reduction becomes essential, and among the various techniques available, Principal Component Analysis (PCA) stands out as one of the most widely used. In this article, we’ll explore PCA step by step—starting from the curse of dimensionality to conceptual foundations, and finally its application in R using the well-known Iris dataset. Table of Contents Lifting the Curse with PCA The Curse of Dimensionality Explained Insights from Shlens’ PCA Paper Conceptual Background of PCA Step-by-Step Implementation in R Interpreting PCA...

CASE vs. Native Groups: Tableau Speed Test Results

Image
  A Great Dashboard Balances Power and Simplicity Every business dashboard should do two things really well: Perform quickly , even with large datasets. Stay simple enough so that users can explore without frustration. But anyone who has worked with big data inside Tableau knows that performance can sometimes become a bottleneck. A dashboard might look beautiful, but if it takes minutes to load, users quickly lose patience. One small but powerful way to speed things up is in how you create groups in Tableau . In this article, I’ll walk you through two approaches: Using Tableau’s native group feature Using a CASE statement in a calculated field We’ll see why the second option often performs faster and is a better fit for large datasets. Why Do We Need Groups in Tableau? Groups are a simple yet powerful feature in Tableau. They let you bundle related items together, so you can analyze them as a single unit. For example, suppose you have a dataset of movie reviews . You might want to...

Animated Visualizations in R with gganimate: A Beginner-Friendly Guide

Image
Data visualization has always been about storytelling. Static charts are powerful, but sometimes they just don’t capture the “motion” or progression behind the numbers. Imagine trying to explain how a country’s GDP has changed over 50 years. A single line chart works, but what if you could watch the growth unfold year by year, almost like a short movie? That’s where gganimate in R comes in. This article will walk you through the basics of animated data visualizations in R using gganimate . We’ll keep it simple, beginner-friendly, and practical—no heavy jargon, just a clear guide with examples. Why Animated Visualizations? Animated plots aren’t just eye candy. They help in situations like: Showing change over time (e.g., sales, population, stock prices). Revealing patterns that are hidden in static charts. Engaging audiences in presentations or dashboards. Making learning fun for stakeholders who aren’t data-savvy. Think of gganimate as a bridge between ggplot2 (R’s go-to visualiz...

A Beginner’s Guide to Correlation in Tableau

Image
  Introduction Data in the right hands can be extremely powerful, shaping strategy, operations, and growth. As American statistician W. Edwards Deming once said, “In God we trust. Everyone else, bring data.” Tableau, one of the leading data visualization platforms, is not just about creating pretty charts—it’s also a statistical powerhouse. With built-in functions and calculations, Tableau allows analysts to explore relationships, uncover patterns, and test hypotheses directly in their dashboards. Among the most misunderstood yet essential statistical concepts in analytics is correlation . Too often, business leaders confuse correlation with causation —a mistake that can lead to costly misinterpretations and flawed decisions. In this article, we’ll take a deep dive into correlation in Tableau—what it means, how to calculate it, and how to visualize it effectively. Along the way, we’ll explore examples, use cases, and best practices to ensure you’re extracting insights responsibly. ...

Integrating FactSet Data into Tableau Using a WDC

Image
  Introduction In today’s world, financial data doesn’t just live inside databases or spreadsheets—it streams continuously from APIs, data providers, and cloud platforms. Financial analysts, portfolio managers, and decision-makers need tools that can pull this information together quickly and visualize it in ways that support better investment decisions. One of the most powerful ways to achieve this is by using Tableau Web Data Connectors (WDCs) . A WDC acts as a bridge between Tableau and web-based data sources, such as FactSet , Bloomberg, or any API that provides financial metrics. By using a WDC, you can build dashboards that fetch real-time stock and ticker data dynamically, enabling analysts to interact with live information rather than static reports. This article revisits the original project where we built a Tableau WDC to connect with FactSet’s API for financial analysis. It also updates the approach with modern trends (as of 2025), exploring how Tableau’s WDC 3.0 framewo...