Welcome to EzEpoch BETA
Choose how you'd like to start your AI training project
Name Your Project
Give your training setup a descriptive name
๐ฏ Project Name
Choose a name that describes your training configuration
๐ก Suggestions:
Data Processing - New Project
Upload, clean, and prepare your training data
๐ค Upload Training Data
Select your training data files (.json, .jsonl, .txt, .csv)
Drag & drop files here
or
Supported formats: .json, .jsonl, .txt, .csv, .tsv โ
Skip Data Upload
Have a large dataset or prefer to add data later?
Create a package without uploading data now - you'll add your data files to the package after download.
How does this work?
1. Skip to Training Setup โ
2. Configure your model and settings โ
3. Download package with empty data/ folder โ
4. Add your training files to the data/ folder before training
โ
Perfect for large datasets (>1GB)
โ
Saves upload time
โ
Full instructions included in package
๐ Previously Uploaded Data
Select from your previously uploaded files or upload new ones above
Smart Data Optimization
After upload, use Data Analysis & Cleaning to optimize your files for training performance.
- Detects and fixes content issues
- Removes short, unhelpful entries
- Splits overly long content into manageable chunks
- Preserves question-answer structure for training
๐ค Uploading to Cloud Storage
Uploading your files securely to cloud storage
๐งน Data Cleaning
Automatic data cleaning and quality assessment
โUpload data to see cleaning results
๐ผ๏ธ Link Images (Optional)
For multimodal training, link images to your text data
โโ Data Review
Review your processed data before training setup
Process data to see summary
Training Setup - New Project
Configure your AI model and training parameters
All parameters are automatically optimized for your selected model and GPU. Simply choose your model and GPU - the system handles batch size, memory optimization, and compatibility automatically!
๐ฏ Project Name
Give your training project a descriptive name
๐ค Model Selection
Choose your AI model and training type
๐ฅ๏ธ GPU Configuration
Configure GPU settings and memory optimization
โ๏ธ Training Parameters
Configure core training settings
Requirements - New Project
PyTorch + Transformers Compatibility & Package Management
Just click "Generate Requirements" and we'll create 66+ optimized packages with perfect compatibility. All conflicts are automatically resolved and installation order is optimized!
๐ PyTorch + Transformers Compatibility
Select the best combination for your setup
๐ง Individual Version Selection
Select specific versions or enable Auto to let the system choose compatible versions
๐ฆ Dependencies & Libraries
Required software packages and versions
Click "Generate Requirements" to create your dependency list.
Advanced Options - New Project
Fine-tune your training with advanced parameters
All settings are pre-optimized for your chosen AI model and GPU. You can proceed through the entire process with these defaults for excellent results, or customize any parameter below for specific needs.
๐ AI Advanced Setup
Let AI optimize your training parameters automatically
API Keys - New Project
Add your HuggingFace token for model access
Add your HuggingFace token to access gated models. Tokens are encrypted and stored securely. Click the ๐๏ธ button to show/hide your token safely.
๐ค HuggingFace Token
Your HuggingFace token is used to download and upload models to your account.
Get your token โ๐ GitHub Token (Optional)
GitHub token enables discovery of models from research repositories and increases API rate limits.
Create token โMulti-Repository Support
EzEpoch searches multiple AI model repositories to give you access to the latest models:
- ๐ค HuggingFace Hub - 500K+ models (requires token for gated models)
- ๐ GitHub Repositories - Research models and implementations (optional token)
- ๐ Papers with Code - State-of-the-art research models (no token needed)
- ๐ฅ PyTorch Hub - Native PyTorch models (no token needed)
GPT Auto-Analysis: All discovered models get optimized settings generated automatically!
๐ค AI Model Library
Browse, search, and manage AI models from HuggingFace
Browse and search from the world's largest AI model library. Use filters to find models compatible with your GPU and training goals. Your favorites and recent models are automatically saved!
๐ Repository Access Status
Connect to repositories to discover and access their models
HuggingFace Hub
500K+ models, gated access available
GitHub Repositories
Research models and implementations
Papers with Code
State-of-the-art research models
PyTorch Hub
Native PyTorch models
๐ Browse Models
Select a repository and browse available models
Select a repository to view available models...
๐ฆ Saved Packages
Manage your completed training packages
Download, organize, and delete your training packages. Search by name, filter by date, or sort by size. All packages are stored securely and ready for cloud deployment.
๐ฆ Complete Packages
Ready-to-deploy training packages 0 packagesNo Saved Packages
Create your first training package to see it here!
๐ค User Profile
Manage your account information and preferences
Update your profile information, manage your subscription, and configure notification preferences. All changes are saved automatically.
๐ Personal Information
๐ Contact Information
๐ค AI Monitoring Preferences
๐ API Hash & Sessions
No active sessions
๐ฆ Build Package - New Project
Create a complete training package for cloud deployment
Everything is ready! Just click "Create Package" and we'll generate a complete training package with all dependencies, scripts, monitoring tools, and your data folders. Ready for any cloud GPU platform!
๐ฅ๏ธ Platform Support
Universal package for RunPod, Vast.ai, Lambda Labs, AWS, GCP
๐ Package Contents
| File | Status | Size | Description |
|---|---|---|---|
| requirements.txt | โณ Pending | ~20KB | Python dependencies (~22 selective packages, EzSetup validated) |
| main.py | โ Complete | ~5KB | Primary training script with repo-specific model loading |
| all.env | โ Complete | ~600B | API keys, model repos, and training configuration |
| setup.sh | โ Complete | ~750B | Environment setup script with EzSetup ordering + venv creation |
| README.md | โ Complete | ~1KB | Setup instructions and quick-start guide for RunPod/Vast.ai |
๐ Package Creation
๐ณ Subscription & Billing
Manage your plan, sessions, and billing
Pay only when training actually starts, not during package creation. Automatic refunds for failures under 15 minutes. Sessions rollover at 50% for paid plans.
๐ Current Plan
๐งฎ How Sessions Work - Model Size Matters
Larger models require more GPT monitoring calls, so they use more sessions
0-20B Models
Mistral 7B, Llama 3.2 8B, Qwen 14B, etc.
21-70B Models
Llama 3.1 70B, Falcon 40B, Mixtral 8x7B, etc.
71-200B Models
Llama 3.2 90B, BLOOM 176B, Falcon 180B, etc.
200B+ Models
Llama 3.1 405B, GPT-3 scale models, etc.
๐ก Why Session Multipliers?
Larger models generate more training logs and require more frequent GPT analysis (every 100 steps). A 7B model might train for 4 hours, while a 200B model can run for 40+ hours. More monitoring = more GPT API costs = more sessions used. This keeps pricing fair and aligned with actual resource usage.
๐ Example Calculations:
- Starter Plan (5 sessions): Train 5x 7B models OR 2x 70B models OR 1x 200B model
- Developer Plan (20 sessions): Train 20x 7B models OR 10x 70B models OR 5x 200B models
- Professional Plan (50 sessions): Train 50x 7B models OR 25x 70B models OR 12x 200B models
๐ Available Plans
Choose the plan that fits your training needs
๐ Training Session History
Your recent training sessions and usage
๐ก๏ธ AI Training Insurance - Zero-Loss Guarantee
We turn 40% industry failure rates into 99% success rates. Guaranteed.
โ Our 99% Success Guarantee
Use EzEpoch's recommended settings and let our AI Guardian monitor your training. If it fails, we refund you. Period.
โ Covered - Full Refund
AI Guardian Failed
Used recommended settings but training crashed and our AI didn't recover it? Full refund.
Setup Configuration Errors
Package won't create, requirements wrong, or settings miscalculated? Full refund.
Platform Errors
Bootstrap auth fails, dashboard doesn't connect, or monitoring stops working? Full refund.
First Training Extended Protection
Your first training gets extra protection (30 minutes) while you learn the platform.
โ Not Covered - No Automatic Refund
Custom Settings Override
Changed batch size, learning rate, or other settings manually? You declined our AI protection.
Data Quality Issues
Training failed due to corrupted data, wrong format, or insufficient data? That's your data, not our platform.
GPU Provider Issues
RunPod crashed? Vast.ai down? GPU out of memory with your custom data? That's between you and your GPU provider.
User Stopped Training
You clicked stop/pause and decided not to continue? That's not a platform failure.
๐ก We Want You To Succeed!
๐ Crash Recovery: Our AI Guardian automatically analyzes crashes and restarts training with corrected settings. Resumes from last checkpoint - no progress lost!
๐ Unlimited Restarts: Each session includes unlimited restart attempts. Training keeps going until YOU succeed.
๐ฏ Expert Support: Stuck? Our team will help debug your setup, review your settings, and guide you to success.
๐ Dashboard Control: Monitor live metrics, adjust settings on-the-fly, and let GPT optimize every 100 steps automatically.
๐ฆถ So Easy, Bigfoot Can Do It!
Industry Standard: 40% of AI training jobs fail due to configuration errors, memory issues, and instability.
With EzEpoch: 99% success rate because our AI calculates perfect settings, monitors every 100 steps, and auto-recovers from crashes.
Think of it like insurance: We insure your SETUP and MONITORING. You provide the GPU and data. Follow our recommended settings โ We guarantee success. Override our settings โ You're on your own (but we'll still help!).
๐ฐ Request Refund
Training failed using our recommended settings? Request your refund below.
โน๏ธ Refund Requirements
- Must have used EzEpoch's recommended settings (no custom overrides)
- Platform error must be verified (setup, monitoring, or dashboard failure)
- GPU provider issues are not covered (contact your GPU provider)
- Data quality issues are not covered (we'll help you fix your data)
- First training gets extended grace period automatically
๐ธ Visual Help Guide
Step-by-step screenshots showing how to use every feature
โ๏ธ Training Setup
Configure your AI model and training parameters
Key Features:
- โ Project naming and organization
- โ Model type selection (Text, Vision, Audio, Multimodal)
- โ GPU configuration and memory optimization
- โ Automatic batch size calculation with live preview
- โ Training parameter configuration (Batch Size, Learning Rate, Epochs)
๐ก Tip: For multimodal training, select combinations like Text + Vision. The system automatically calculates combined memory requirements and optimal batch size.
๐ง Advanced Options
Fine-tune precision, optimizers, and training modes
๐ฏ Key Features:
- โ ๐ค AI-Optimized Defaults: All settings pre-configured by AI for your model and GPU
- โ โก Precision Settings: FP16, BF16, FP32, INT8, INT4 for memory optimization
- โ ๐ง Smart Optimizers: AdamW, Lion, SGD, Adafactor with AI-recommended settings
- โ ๐๏ธ Training Methods: Full Fine-tuning, LoRA, QLoRA with automatic configuration
- โ ๐ AI Monitoring: GPT monitors every 100 steps, analyzes metrics, provides real-time insights
- โ ๐ Dynamic Adjustment: Change learning rate scheduler, save strategy, evaluation strategy on-the-fly
๐ What Sets EzEpoch Apart: Our AI Guardian continuously analyzes your training progress (loss, gradients, GPU usage) and provides intelligent recommendations to optimize performance, prevent failures, and maximize efficiency - something no other platform offers!
๐ Requirements
Generate dependencies and check for conflicts
Key Features:
- โ PyTorch + Transformers + CUDA compatibility presets
- โ Auto-Select preset chooses optimal versions for your GPU
- โ Automatic dependency generation (66+ optimized packages)
- โ Conflict-free installation with proper ordering
- โ Latest, Stable, Older, Legacy, and CPU-only presets available
๐ก Tip: Use "Auto-Select (Recommended)" preset for best compatibility. It automatically picks the right PyTorch, Transformers, and CUDA versions for your selected GPU.
๐ API Keys
Set up authentication for external services
Key Features:
- โ HuggingFace token for model access (required for gated models)
- โ GitHub token for research repositories (optional, increases rate limits)
- โ Eyeball button (๐๏ธ) to show/hide tokens securely
- โ Test & Save validates tokens before storing
- โ Multi-repository support: HuggingFace, GitHub, Papers with Code, PyTorch Hub
๐ก Tip: Click the ๐๏ธ button to toggle token visibility while entering. All tokens are encrypted and securely stored. Only HuggingFace token is required for most models.
๐ฆ Build Package
Create complete training packages for deployment
Key Features:
- โ Universal platform support (RunPod, Vast.ai, Lambda Labs, AWS, GCP)
- โ Package contents preview (requirements.txt, main.py, all.env, setup.sh, README.md)
- โ Custom package naming (defaults to project name)
- โ One-click package creation
- โ Core files streamed from server for updates and flexibility
๐ก Tip: After creating your package, download the ZIP and upload it to your preferred GPU provider. Most files (like main.py, requirements.txt) are streamed from our server during bootstrap.sh execution, allowing us to provide updates without re-downloading.
๐ค AI Models
Browse and select from thousands of AI models
Key Features:
- โ Browse 100,000+ models from HuggingFace Hub
- โ Text, Vision, and Audio models organized by category
- โ Repository access status (HuggingFace, GitHub, Papers with Code, PyTorch Hub)
- โ Model details: size, type, description
- โ "Add to Setup" button instantly configures the selected model
๐ก Tip: Filter models by type (Text, Vision, Audio) and use the search bar to find specific models. All models include AI-optimized settings automatically generated for your GPU.
๐ค AI-Powered Dashboard
Revolutionary AI monitoring that sets EzEpoch apart
GPT monitors your training and provides real-time insights
๐ Revolutionary AI Features:
- ๐ค GPT Training Analysis: AI continuously monitors training progress and identifies issues
- ๐ก Smart Recommendations: Real-time suggestions for parameter adjustments
- โ ๏ธ Predictive Alerts: AI predicts and prevents training failures before they happen
- ๐ Intelligent Optimization: Automatic parameter tuning based on training patterns
- ๐ฏ Performance Insights: AI explains why certain settings work better
- ๐ Adaptive Learning: System learns from your training patterns to improve future sessions
๐ EzEpoch's Competitive Edge: We're the ONLY platform that uses GPT to actively monitor and optimize your training in real-time. While others just show metrics, we provide intelligent insights that save you time, money, and prevent costly failures!
๐ค Profile
Manage your account and preferences
Key Features:
- โ Personal information and contact details management
- โ AI monitoring preferences (email notifications, frequency)
- โ Your unique API hash for authentication
- โ Active training sessions display
- โ Timezone configuration for accurate scheduling
๐ก Tip: Your API hash is used for secure authentication across all EzEpoch services. Copy it for use in your training packages or dashboard access.
๐พ Saved Packages
Manage your training packages and downloads
Key Features:
- โ View all your created training packages
- โ Download packages for cloud deployment
- โ Search and sort by date or size
- โ Package details (config, settings used)
- โ Delete old packages to free space
๐ก Tip: Each package is stored with a unique ID and includes all configuration, requirements, and scripts. Simply download and upload to your GPU provider to start training!
๐ Complete Workflow Guide
Step-by-step process from setup to deployment
Training Setup
Configure project, select model type, and set GPU parameters
Advanced Options
Fine-tune precision, optimizers, and training methods
Requirements
Generate dependencies and check for conflicts
API Keys
Set up HuggingFace and GitHub authentication
Build Package
Create complete training package for cloud deployment
โฑ๏ธ Total Time: 15-45 minutes depending on complexity
๐ฏ Result: Ready-to-deploy training package with all dependencies
๐ Complete Settings Dictionary
Comprehensive guide to every parameter and setting
๐ฏ Training Parameters
โก Precision & Optimization
๐ง Optimizers
๐๏ธ Training Methods
๐ฅ๏ธ GPU & Memory
๐ค AI Monitoring (EzEpoch Exclusive)
๐ง MyEzSetup - Dependency Checker
Intelligent dependency management and conflict resolution
๐ฏ Key Features:
- โ Dependency Compatibility Checker: Paste your requirements.txt and get instant compatibility analysis
- โ Auto-Generated Pinned Files: Creates conflict-free requirements.txt with proper version pinning
- โ Installation Scripts: Generates platform-specific setup scripts (install.sh / install.bat)
- โ CUDA Support: CPU-only, CUDA 11.8, and CUDA 12.1 configurations
- โ Platform Detection: Auto-detects Windows, Linux, macOS requirements
- โ API Access: RESTful API for integration with training pipelines
๐ Access MyEzSetup: Visit www.myezsetup.com or ezsetup.ezepoch.com
๐ก How It Works:
- Paste your requirements.txt file
- Run compatibility check to identify conflicts
- Select target platform and CUDA version
- Generate pinned requirements.txt and installation script
- Copy the script and run it on your system
๐ Free Tier: Sign up for free and get 5 dependency checks per month. Upgrade to MyEzSetup ($4.99/month) or MyEzSetup API ($9.99/month) for unlimited access.