Resource management and real-time scheduling of embedded AI inferences


The thesis explores the potential of efficient resource management and real-time scheduling strategies for deploying advanced deep neural networks (DNNs), such as transformers, on low-power embedded systems. Possible areas of focus include creating real-time scheduling algorithms for timely and efficient AI inferences, optimizing the use of low-power AI accelerators, improving SRAM and memory bandwidth utilization, and implementing dynamic power and thermal management techniques. Practical applications like virtual reality and real-time text generation will be considered to validate the effectiveness of these methods, aiming to enhance the deployment of efficient and reliable AI solutions in resource-constrained environments.

Feel free to contact Binqi Sun (binqi.sun@tum.de) for latest available topics. You are also welcome to bring new ideas/challenges!

 

Requirements

Hands-on experience with DNN tranining and inference. 

Basic knowledge of combinatorial optimization and scheduling. 

Thesis Type

Semesterarbeit | Masterarbeit

Contact

Binqi Sun

Gebäude 5501 Raum 2.102a

+49 (89) 289 - 55183

binqi.sun@tum.de