COMP4901D: Embedded AI Systems (Spring 2026)
Computer Science and Engineering, Hong Kong University of Science and Technology
How to run AI models on mobile or embedded devices? What are the unique challenges in embedded AI systems?
Instructor: Xiaomin Ouyang
Class Time: Wednesday/Friday 3pm - 4:20pm (Room 2304, Lift 17/18)
Email: xmouyang@cse.ust.hk (For course-related queries, please use the subject starting from [COMP4901D])
Office Hour: Friday 4:30pm - 5:30pm, Room 3562, Lift 27/28
Teaching Assistants: Mr. Haoxian Liu (hliueu@connect.ust.hk) and Miss Liyu Zhang (lzhangcx@connect.ust.hk) (For enrolled students: Recorded videos for absent classes can be requested by emailing the TA directly.)
Communication and Announcement: Canvas is the main platform for communication and announcement. For students auditing without official enrollment, please email me your name, student ID, and email address to access the course resources not available here and to stay updated with any course changes.
Course Description: This course will introduce the techniques on deploying AI models on ubiquitous yet resource-constrained platforms such as embedded, mobile, and edge devices. Students will develop a deep understanding of embedded and on-device AI systems, gain hands-on experience in implementation, and learn the practical challenges of applying AI in real-world scenarios.
The major topics include 1) Fundamentals of machine learning and embedded systems; 2) Data and system challenges in embedded AI; 3) AI techniques and their efficient deployment on cutting-edge platforms; 4)Real-world applications, such as mobile agent and smart health. The course structure will primarily consist of instructor lectures, paper presentations and hands-on projects/labs.
Course Objectives: After completion of this course, the students will have an in-depth understanding of challenges in embedded AI systems, and be equipped to deploy AI models, including large language models (LLMs), on embedded, mobile and edge devices for applications such as mobile agents, smart health, and more.
Pre-requisite: (COMP 2012 OR COMP 2012H) AND COMP 2211. Basic understanding of machine learning is required. Familiarity or experience with the fundamentals of embedded or mobile systems are preferred.
Grading Scheme:
- Class attendance & discussion: 10%
- Paper presentation: 10%
- Team Project: 40%
- Proposal presentation (10%)
- Midterm presentation (10%)
- Final presentation (10%)
- Final report (10%)
- Final Exam: 40%
Reference Materials:
- Conferences: Proceedings of MobiCom/MobiSys/SenSys/UbiComp/IPSN/IoTDI/NeurIPS/ICML/ICLR/CVPR/ACL
- Book: Siam, S.I., et al. (2024). Artificial Intelligence of Things: A Survey. ACM Transactions on Sensor Networks. website
Course Syllabus
| Date | Topics | Materials | Note |
|---|---|---|---|
| Feb 4 (Wednesday) | Course Introduction and Overview | Lecture 0 | |
| Feb 6 (Friday) | Machine Learning Basics | Lecture 1 | |
| Feb 11 (Wednesday) | Course Projects & Challenges in Embedded AI Systems | Lecture 2 | |
| Feb 13 (Friday) | Challenges in Embedded AI Systems | Lecture 3 | |
| Feb 18 (Wednesday) | No class (Public Holiday) | ||
| Feb 20 (Friday) | Cancelled (Makeup on Feb 23) | ||
| Feb 23 (Monday) | AI Techniques for Distributed and Imperfect Data - Unsupervised Learning | ||
| Feb 25 (Wednesday) | AI Techniques for Distributed and Imperfect Data - Unsupervised Learning | Paper Presentation | |
| Feb 27 (Friday) | AI Techniques for Distributed and Imperfect Data - Multimodal and Federated Learning | ||
| Mar 4 (Wednesday) | AI Techniques for Distributed and Imperfect Data - Multimodal and Federated Learning | Paper Presentation | |
| Mar 6 (Friday) | **Project Proposal Presentation & Feedback** | 8min pre + 2min QA | |
| Mar 11 (Wednesday) | Physics-strengthen AI for Sensing Systems | ||
| Mar 13 (Friday) | Physics-strengthen AI for Sensing Systems | Paper Presentation | |
| Mar 18 (Wednesday) | Efficient Deep Learning on the Edge – Model Compression | ||
| Mar 20 (Friday) | Efficient Deep Learning on the Edge – Runtime Optimization | ||
| Mar 25 (Wednesday) | Efficient Deep Learning on the Edge | Paper Presentation | |
| Mar 27 (Friday) | **Midterm Project Presentation** | 15min pre + 5min QA | |
| April 1 (Wednesday) | **Midterm Project Presentation** | 15min pre + 5min QA | |
| April 3 (Friday) | No class (Midterm Break) | ||
| April 8 (Wednesday) | No class (Midterm Break) | ||
| April 10 (Friday) | LLMs and Foundation Models on the Edge - Applications | ||
| April 15 (Wednesday) | LLMs and Foundation Models on the Edge - Applications | Paper Presentation | |
| April 17 (Friday) | LLMs and Foundation Models on the Edge - Efficiency | ||
| April 22 (Wednesday) | LLMs and Foundation Models on the Edge - Efficiency | Paper Presentation | |
| April 24 (Friday) | Applications and End-to-end Systems | ||
| April 29 (Wednesday) | Applications and End-to-end Systems | Paper Presentation | |
| May 1 (Friday) | No class (Public Holiday) | ||
| May 6 (Wednesday) | **Final Project Presentation** | 15min pre + 5min QA | |
| May 8 (Friday) | **Final Project Presentation** | 15min pre + 5min QA | |
| May 9 (Saturday) | Report Deadline |