RESULTS & CONCLUSIONS

Results

The system demonstrates significant improvements in performance and efficiency when leveraging the FPGA and custom-designed NPU compared to standalone software or previous designs. The results are summarized below:

CNN structure
Figure 1. Results

Compared to the Raspberry Pi software implementation, the FPGA + NPU setup is 218x faster, demonstrating the significant impact of hardware acceleration in reducing computation time and resource usage.

Comparison with 2023

Compared to the previous year's design, the 2024 system introduces significant improvements in both processing speed and architectural efficiency. Key comparisons are summarized below:

CNN structure
Figure 2. Comparison with 2023

These improvements reflect advancements in both hardware and system integration. By adopting CNNs, increasing operational frequency, and optimizing the architecture, the 2024 system sets a new benchmark in performance compared to last year's implementation.

Conclusions

This project showcases the power of custom hardware design in accelerating CNN inference for handwritten digit recognition. By transitioning to a CNN-based architecture, increasing operational frequency, and enhancing the system architecture, the current implementation surpasses previous designs in both speed and efficiency.

Key improvements this year include:

The combined improvements in architecture and hardware integration provide a strong foundation for further optimization and expansion, solidifying the project's potential for real-world AI applications.



Back to top