Part1 : Provided a dataset of volume sales of products from 2019 to 2022 run an extensive exploratory data analysis including the following: . 1. Data Quality & Structure Checks Missing values, duplicates, negative sales, outliers Date consistency (no gaps, proper frequency, handling holidays/weekends) 2. Descriptive Statistics Overall distribution of daily sales (mean, median, std, skewness, kurtosis) By dimension: product, customer Identify top products per customer by volume 3. Time Series Exploration Trend: long-term upward or downward movement Seasonality: daily/weekly patterns (weekdays vs weekends), monthly, quarterly, yearly cycles Rolling averages (7-day, 30-day) to smooth patterns 4. Visualization Layer Time series plots: raw daily sales, moving averages Boxplots: distribution of sales by weekday or month Histograms/density plots: sales distribution 5. Anomaly & Outlier Detection Unusual spikes/drops Use Z-scores or interquartile ranges to flag anomalies 6. Correlation & Drivers of Sales Correlation if needed 7. Performance Metrics (Baseline) Set benchmarks to prepare for forecasting models: Average daily sales per SKU/store Volatility (Coefficient of Variation) Baseline forecast error (e.g., naïve forecast MAPE)
EDA Deliverables : By the end of an extensive EDA, I should have: Clear understanding of demand patterns, seasonality, and anomalies Insights into drivers of sales (internal like price/promo, external like weather/events) Segmentation of products into high/medium/low performers A baseline performance snapshot to compare forecasting models against.
Part 2 : After cleaning the data based on the above analysis, run a linear regression-based model to prepare a sales volume forecast at product & customer level for 2022 in python or/and pyspark. Measure the accuracy by introducing quality measures and explain why have you introduced these measures.
Performance Metrics Dashboard Design Category: Data Analysis, Data Management, Data Processing, Data Visualization, Graphic Design, Photoshop, User Interface / IA, Web Design Budget: ₹750 - ₹1250 INR
10-Feb-2026 11:03 GMT
Outbound Client Data Verification Calls Category: Data Entry, Data Management, Excel, Google Sheets, Lead Generation, PHP, Telecom, Telemarketing Budget: ₹1500 - ₹12500 INR
10-Feb-2026 11:03 GMT
class 12 maths book Category: Academic Research, Academic Writing, Education & Tutoring, Educational Research, Math Tutoring, Mathematics, Matlab And Mathematica, Research Writing Budget: ₹100 - ₹400 INR
10-Feb-2026 11:02 GMT
Consultanță Campanie Kickstarter Produs Nou Category: Animation, Content Creation, Crowdfunding, Graphic Design, Internet Marketing, Kickstarter, Logo Design, Project Management Budget: $250 - $750 USD
10-Feb-2026 11:02 GMT
Reader-to-Camera Video Transformation Category: After Effects, Audio Editing, Video Conferencing, Video Editing, Video Post Editing, Video Processing, Video Production, Video Services Budget: ₹600 - ₹1500 INR
Real-Time Sneaker Marketplace Build -- 2 Category: Android, Flutter, IPhone, Mobile App Development, Payment Gateway Integration, PhoneGap, React Native, Web Development Budget: ₹12500 - ₹37500 INR
10-Feb-2026 10:59 GMT
Word to PDF Design Overhaul -- 2 Category: Branding, Graphic Design, Microsoft Word, Pitch Deck Writing, Typography, Visual Design Budget: $250 - $750 AUD
10-Feb-2026 10:58 GMT
Real-Time Sneaker Marketplace Build Category: Android, Flutter, IPhone, Mobile App Development, Payment Gateway Integration, PhoneGap, React Native, Web Development Budget: ₹12500 - ₹37500 INR
10-Feb-2026 10:58 GMT
Legacy Software Setup for Windows 11 Category: Computer Support, System Administration, TeamViewer, Technical Support, Virtualization, Windows Desktop Budget: £20 - £250 GBP
Remote Beginner Data Entry Category: Data Analysis, Data Entry, Data Management, Data Processing, Excel, Google Sheets, Time Management, Virtual Assistant Budget: ₹12500 - ₹37500 INR