This portfolio highlights hands-on data projects using tools like R, SQL, and Excel. Each section includes a real-world problem, the tools used, and key insights visualized for impact.

Student Performance Analysis: A Google Data Analytics Capstone Project.
Introduction
The purpose of this capstone project is to analyze student performance data in order to identify patterns and factors that influence academic success. By uncovering key insights related to variables such as study time, absences, parental education, and other personal or school-related factors, this analysis aims to help educators and policymakers make informed decisions to improve student outcomes.
Business Task
The goal is to uncover insights that can help educators understand key drivers of academic success and identify opportunities for targeted support.
About the Author
Francis Pekpe is a certified data analyst and published author of Power Within and Unbreakable. With a strong foundation in storytelling and communication, they combine their analytical skills and creative background to deliver insights that are both meaningful and easy to understand. After completing the Google Data Analytics Certificate, they now apply their expertise in R, SQL, and data visualization to real-world problems, as shown in this capstone project on student performance.
Data Source
To ensure the accuracy and reliability of the analysis, the dataset was cleaned using R. This process included checking for and removing any missing or null values, standardizing variable formats, and selecting only the most relevant columns for analysis—such as age, gender, study time, absences, and final grades. Duplicate entries were also checked and none were found. The cleaned dataset provides a solid foundation for meaningful exploration and visualization.
Data Cleaning
The original dataset contained 649 student records and 33 variables. Before analysis, several data preparation steps were performed to ensure the quality and relevance of the data.
age sex |
Min. :15.00 Length:649 1st Qu.:16.00 Class :character Median :17.00 Mode :character Mean :16.74 3rd Qu.:18.00 Max. :22.00 |
studytime failures |
Min. :1.000 Min. :0.0000 1st Qu.:1.000 1st Qu.:0.0000 Median :2.000 Median :0.0000 Mean :1.931 Mean :0.2219 3rd Qu.:2.000 3rd Qu.:0.0000 Max. :4.000 Max. :3.0000 |
absences G1 |
Min. : 0.000 Min. : 0.0 1st Qu.: 0.000 1st Qu.:10.0 Median : 2.000 Median :11.0 Mean : 3.659 Mean :11.4 3rd Qu.: 6.000 3rd Qu.:13.0 Max. :32.000 Max. :19.0 |
G2 G3 |
Min. : 0.00 Min. : 0.00 1st Qu.:10.00 1st Qu.:10.00 Median :11.00 Median :12.00 Mean :11.57 Mean :11.91 3rd Qu.:13.00 3rd Qu.:14.00 Max. :19.00 Max. :19.00 |


Insights and Recommendations
– Students who report studying more hours (study time level 3 or 4) tend to achieve higher final grades.
– There is a noticeable difference in grade distributions between male and female students, with females slightly outperforming on average.
– High absenteeism and past class failures appear to correlate with lower grades.
Recommendations
– Provide additional support to students with high absenteeism and previous failures, as they are more at risk of underperformance.
– Encourage consistent study habits, especially for those in the lower study time categories.
– Consider gender-based engagement strategies to ensure equitable support across the student body.
These findings can inform school policies or interventions aimed at improving academic performance.
dashboard – tableau
Introduction:
This project demonstrates how structured data from a fictional retail dataset was analyzed using SQL to uncover business insights. The focus was on identifying top-performing products, analyzing profit margins across categories, and spotting regional trends in sales performance. Through SQL queries, I efficiently filtered, grouped, and joined data tables to produce actionable insights that can support strategic business decisions. This project showcases my ability to work with relational databases, extract meaningful metrics, and communicate findings effectively.

View Dashboard on Tableau Public
Summary of Data & Insights
This project visualizes sales and profit data across regions and product categories. The goal was to surface actionable insights for optimizing sales strategy, inventory decisions, and marketing focus.
Key Insights:
· Western region leads in sales, Southern region leads in profitability.
· Technology is the top-selling category, but Office Supplies offers higher margins.
· Sales spikes during specific quarters suggest opportunities for seasonal campaigns.
Challenges and Value Delivered
Challenges:
- Cleaning raw data for Tableau compatibility
- Designing a layout that communicates both sales trends and profit breakdowns clearly
- Creating filters that allow stakeholder-level interactivity
Value Delivered:
- Created a visually intuitive dashboard that allows sales and marketing teams to explore performance by region and category
- Delivered actionable insights that help stakeholders plan promotions and inventory strategy
Marketing Performance Analysis (Email, Social & Ads)
This analysis presents a clear, visual representation of a multi-channel marketing campaign spanning five months. The goal was to monitor user engagement and conversion across three primary digital channels: Email, Social Media, and Online Ads.
Objective:
To track the effectiveness of each marketing channel in driving user actions, specifically clicks (interest) and conversions (action), and evaluate trends that could guide future strategy.
Key Insights:
- Consistent Growth: All three channels saw a steady increase in both clicks and conversions from January to May.
- Highest Engagement: Ads consistently drove the highest number of clicks and conversions, making it the top-performing channel.
- Email Efficiency: While Email had fewer clicks, it maintained a healthy conversion rate—suggesting more qualified leads.
- Social Media’s Role: Strong mid-tier performer with balanced growth, indicating successful audience targeting and engagement.

Visual Interpretation:
- Solid Lines represent Clicks for each channel.
- Dashed Lines represent Conversions, tracking how effectively interest translated into action.
- The chart makes it easy to detect dips (like in March for Email) and surges (like Ads in April/May), which are useful for performance reviews and campaign timing.
Conclusion:
This dashboard illustrates how data visualization helps in evaluating the return on marketing investments. By aligning efforts with months of higher engagement, teams can optimize campaigns, allocate budgets wisely, and drive meaningful growth.