top of page

Create Your First Project

Start adding your projects to your portfolio. Click on "Manage Projects" to get started

Unemployment Rates in California

Project Overview: This project involved developing an empirical model to analyze California's unemployment rates over the period from 1976 to 2020. The objective was to understand the trends, seasonal patterns, and economic crises that influenced unemployment in the state, including the 2008 recession and the COVID-19 pandemic. The analysis utilized data from the Federal Reserve Economic Data (FRED), and R was used for data compilation, statistical analysis, and visualization. The final analysis provided insights into cyclical labor market behavior, contributing to understanding the economic dynamics in California.

Key Tasks and Activities:
Data Collection and Compilation:
I accessed and collected county-level unemployment data from FRED (Federal Reserve Economic Data) for all counties in California. This dataset included historical unemployment rates and other relevant economic indicators.
Using R, I compiled the data, cleaning and preparing it for analysis. This involved removing missing values, handling discrepancies, and merging datasets from multiple counties to form a comprehensive dataset.

Unemployment Trend Analysis:
In this step, I used R to perform exploratory data analysis (EDA) on the unemployment data. I identified significant economic events, such as the 2008 financial crisis and the COVID-19 pandemic, that caused spikes in unemployment.
I applied statistical tools like mean, median, and standard deviation to summarize the data. The mean unemployment rate during this period was calculated to be 7.14%, and the highest unemployment rate recorded was 16.40% during the COVID-19 peak.

Time Series Decomposition:
I used time series decomposition to break down the unemployment rate data into its trend, seasonal, and residual components. This helped in understanding long-term trends (e.g., steady rise or fall), seasonal fluctuations (e.g., patterns related to economic cycles), and irregular components (e.g., unexpected crises).
This decomposition provided insights into cyclical labor market patterns, including periods of economic expansion and contraction.

Visualization:
I used dynamic plots in R, including line charts, seasonal plots, and heat maps, to visualize the unemployment trends over the years. The plots helped to capture fluctuations in unemployment, especially during key events like the 2008 recession and the COVID-19 peak.
Interactive visualizations were created to explore the seasonal variations and the impact of specific crises on unemployment rates.

Empirical Modeling in Excel:
In addition to the R-based analysis, I developed an empirical model in Excel to estimate and forecast unemployment rates in California. The model used statistical techniques such as linear regression and time series forecasting to predict future unemployment trends based on historical data.

Tools and Techniques Used:
R Programming:
R was used for data cleaning, statistical analysis, and visualization. Techniques included time series analysis, decomposition, and statistical modeling. Key R packages like ggplot2, forecast, and dplyr were used for plotting, data wrangling, and time series analysis.

Excel:
In Excel, I developed an empirical model using built-in functions like linear regression and forecasting. I also used pivot tables to summarize unemployment data and perform quick analysis.

Statistical Analysis:
I used various statistical measures such as mean, standard deviation, and time series decomposition to identify patterns and trends.
The decomposition technique helped break down the unemployment data into trend, seasonality, and residuals, allowing for better understanding of underlying patterns.

Data Visualization:
I created dynamic plots to visualize the unemployment trends over time, using line charts for trend visualization, seasonal plots for cyclic behavior, and heat maps for identifying hotspots during economic crises.

Data Set Used:
The dataset used for this project was sourced from FRED (Federal Reserve Economic Data). The data includes county-level unemployment rates for California from 1976 to 2020, covering both state-level and county-specific figures. This data is rich in historical context, providing insights into the labor market across different periods of economic turmoil.
Recommendations:
Key Findings:
The project highlighted several key trends in California's unemployment rates over the analyzed period, including:
A cyclical labor market, with clear expansions and contractions during economic booms and busts.
A notable spike in unemployment during the 2008 recession, with unemployment rates reaching a high point of around 12%.
The COVID-19 pandemic caused an unprecedented surge in unemployment, with a 16.40% peak, which represented a major labor market shock.

Insights and Implications:
The unemployment data revealed that California's unemployment rates follow a cyclical pattern, with periods of high unemployment following economic crises, followed by gradual recovery.
The decomposition analysis helped to highlight seasonal trends in unemployment, driven by various economic cycles (e.g., fiscal year budget cuts, holidays).

Recommendations:
Economic Planning: Businesses and local governments can use these insights to better prepare for potential economic downturns by focusing on sectors that are more resilient to economic shocks.

In conclusion, this project provided valuable insights into California's unemployment dynamics and laid the groundwork for more advanced economic modeling and forecasting. The empirical model developed in Excel and the R-based analysis have practical implications for policymakers and businesses looking to understand labor market trends and plan for future crises.

bottom of page