Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

A Practical Guide to Running AI/ML Experiments on GCP

A hands-on tutorial for beginners who want to use Google Cloud Platform (GCP) for large-scale machine learning experiments or AI workloads.

🎯 What You’ll LearnΒΆ

πŸ“‹ PrerequisitesΒΆ

πŸ•₯ DurationΒΆ

4-6 hours (can be split into multiple sessions)

πŸ“š Tutorial StructureΒΆ

Core PathΒΆ

PartTitleTimeLearning Objective
1GCP Foundations30 minsUnderstand GCP and interact with it via the console and gcloud CLI
2Running Experiments on GCP VMs45 minsLearn how to run AI/ML experiments on GCP virtual machines with GPU support
3Vertex AI65 minsUnderstand how to use the managed Vertex AI service for training jobs and notebooks
4Cost Management30 minsUnderstand budget alerts and best practices for cost management

Appendix (Optional)ΒΆ

AppendixTitleTimeFocus
AGPU Memory Maths45 minsUnderstand how to estimate GPU memory requirements, avoid OOM errors, and choose the right GPU for your needs

πŸ’‘ What Makes This Tutorial DifferentΒΆ

This tutorial teaches you what you need to run AI/ML experiments on GCP, emphasizing:

The content is also structured to be used on-demand so you do not have to read the entire tutorial.

πŸŽ“ Who This is ForΒΆ

This tutorial was particularly designed for AIMS AI for Science students who do not have a background in cloud computing or software engineering, helping them use GCP more effectively for their research projects.

However, it should be generally helpful for:

🧠 How to Use This Tutorial¢

Start from Part 1 and work through sequentially.

Option 2: Jump to What You NeedΒΆ

πŸ«±πŸΏβ€πŸ«²πŸΎ ContributingΒΆ

This tutorial is fully open-source! We welcome:

See CONTRIBUTING.md for details.

πŸ“„ LicenseΒΆ

This tutorial is released under the CC-BY-4.0 License. See LICENSE for details.


Let’s get started! β†’ Part 1: GCP Foundations