VAST lab at UCLA

The VAST lab at UCLA investigates cutting-edge research topics at the intersection of VLSI technologies, design automation,  architecture and compiler optimization at multiple scales, from micro-architecture building blocks,  to heterogeneous compute nodes, and scalable data centers.  Current focuses include architecture and design automation for emerging technologies, customizable domain-specific computing with applications to multiple domains, such as imaging processing, bioinformatics, data mining and machine learning.

Latest News

May 2, 2018 | 0 comments

From Apr 29th to May 1st, Zhenyuan Ruan, Weikang Qiao, Jie Wang, Tianhe Yu, Prof. Cong from VAST lab and Dr. Zhenman Fang (postdoc alumni from VAST lab) attended 2018 International Symposium on Field-Programmable Custom Computing Machines (FCCM...

April 26, 2018 | 0 comments

We are pleased to announce that Professors Jason Cong (CS) and Song-Chun Zhu (CS and Statistics) are part of the University of Virginia’s new $27.5M Center on Research in Intelligent Storage and Processing in Memory (CRISP)—one of six Joint...

April 7, 2018 | 0 comments

Prof. Cong delivered a distinguished lecture at the ECE Department of Northeastern University in Boston, MA on February 21, 2018. The title of Prof. Cong's speech is "Computing Near the End of Moore's Law". 

The link of the video is :...

Latest Publications

[PDF]: Scaling for edge inference of deep neural networks
Journal publication
Xiaowei Xu , Yukun Ding, Sharon Xiaobo Hu, Michael Niemier, Jason Cong, Yu Hu, and Yiyu Shi
S2FA: An Accelerator Automation Framework for Heterogeneous Computing in Datacenters
Conference publication
Cody Hao Yu, Peng Wei, Max Grossman, Peng Zhang, Vivek Sarkar, Jason Cong
Automated Accelerator Generation and Optimization with Composable, Parallel and Pipeline Architecture
Conference publication
Jason Cong, Peng Wei, Cody Hao Yu, Peng Zhang
[PDF]: Latte: Locality Aware Transformation for High-Level Synthesis
Conference publication
Jason Cong, Peng Wei, Cody Hao Yu, Peipei Zhou
[PDF]: ST-Accel: A High-Level Programming Platform for Streaming Applications on FPGA
Conference publication
Zhenyuan Ruan, Tong He, Bojie Li, Peipei Zhou, and Jason Cong
[PDF]: High-Throughput Lossless Compression on Tightly Coupled CPU-FPGA Platforms
Conference publication
Weikang Qiao, Jieqiong Du, Zhenman Fang, Michael Lo, Mau-Chung Frank Chang, Jason Cong
[PDF]: Understanding Performance Differences of FPGAs and GPUs
Conference publication
Jason Cong, Zhenman Fang, Michael Lo, Hanrui Wang, Jingxian Xu and Shaochong Zhang
[PDF]: Doppio: I/O-Aware Performance Analysis, Modeling and Optimization for In-Memory Computing Framework
Conference publication
Peipei Zhou, Zhenyuan Ruan, Zhenman Fang, Megan Shand, David Roazen, Jason Cong
[PDF]: RC-NVM: Enabling Symmetric Row and Column Memory Accesses for In-Memory Databases
Conference publication
Peng Wang, Shuo Li, Guangyu Sun, Xiaoyang Wang, Yiran Chen, Hai (Helen) Li, Jason Cong, Nong Xiao, and Tao Zhang

Our Projects

Heterogeneous computing with extensive use of accelerators, such as FPGAs and GPUs, has shown great promise to bring in orders of magnitude improvement in computing efficiency for a wide range of applications. The latest advances in industry have led to highly integrated heterogeneous hardware...

Moore’s law has been driving the exponential growth of information technology for more than 50 years, during which the ever-increased computing power has had a huge impact on people’s lives. AI algorithms such as reinforcement learning can help super powerful computers beat humans in specific...

In the Big Data era, the volume of data is exploding, putting forward a new challenge to the existing computer system. Traditionally, the computer system is designed to be computing-centric, in which the data from IO devices are transferred and then processed by the CPU. However, the data...

With the recent advancement of multilayer convolutional neural networks (CNN), deep learning has achieved amazing success in many areas, especially in visual content understanding and classification. To improve the performance and energy-efficiency of the computation-demanding CNN, the FPGA-...

Many applications in precision medicine present significant computational challenges.  For example, the computation demand for personalized cancer treatment is prohibitively high for the general-purpose computing technologies, as tumor heterogeneity requires great sequencing depths,...

With the increasing of the system complexity, the needs of system level design automation becomes more and more urgent. The maturity of high-level synthesis pushes the desgin abstraction from register-transfer level (RTL) to software programming language like C/C++. However, the state-of-art...

To meet ever-increasing computing needs and overcome power density limitations, the computing industry has entered theera of parallelization, with tens to hundreds of computing cores integrated into a single...

Software Releases

Cloud-scale BWAMEM (CS-BWAMEM) is an ultrafast and highly scalable aligner built on top of cloud infrastructures, including Spark and Hadoop distributed file system (HDFS). It leverages the abundant computing resources in a public or private cloud to fully exploit the parallelism obtained from...

With the rapid evolution of CPU-FPGA heterogeneous acceleration platforms, it is critical for both platform developers and users to quantify the fundamental microarchitectural features of the platforms. We developed a set of microbenchmarks to evaluate mainstream CPU-FPGA platforms.


PARADE is a cycle-accurate full-system simulation platform that enables the design and exploration of the emerging accelerator-rich architectures (ARA). It extends the widely used gem5 simulator with high-level synthesis (HLS) support.