MSc student @Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in Machine Learning. Previously research assistant @Tianjin University. Research interests: Machine Learning, Causality, Natural Language Processing and Optimization etc. Rightnow, I am supervised by Dr. Kun Zhang. I am also a member of Center of Integrated Artificial Intelligence (CIAI), lead by Prof. Eric Xing.
An implementation of OpenAI's GPT2 with Huawei's auto-differentiate framework MindSpore.
A collection of articles, presentations or talks, coming soon ;)
I was one of the main contributors of an open source repository developed mainly with MindSpore (Link). This repo reproduced OpenAI’s GPT-2 from scratch and the model achieved the same performance as the original model in translation, language modeling and summarization tasks. Specifically, I took in charge of text generation framework and deployment tools. I also fine tuned this GPT-2 model in summarization task on NVIDIA Docker and HUAWEI Ascend devices. Moreover, I devoted in model compression and other techniques which could help accelerate inferring on device like mobile phones and edge computing devices. What’s more, I engaged in applying algorithms which are originated from quantum physics and quantum information (e.g. tensor network) on machine learning applications on edge computing device (e.g. ARM based raspberry pi).
TA of Natural Language Understading. Gave a lecture about neural machine translation. Also gave a talk about pretrained language model and its application. Help design assignment and grading.
GPA: 4.0/4.0