https://github.com/cornell-zhang/facedetect-fpga

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 4 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.7%) to scientific vocabulary

Last synced: 6 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: cornell-zhang
License: bsd-3-clause
Language: C++
Default Branch: master
Size: 646 KB

Statistics

Stars: 41
Watchers: 10
Forks: 35
Open Issues: 3
Releases: 0

Created about 9 years ago · Last pushed almost 6 years ago

https://github.com/cornell-zhang/facedetect-fpga/blob/master/

Publications
------------------------------------------------------------------------

If you use this benchmark in your research, please cite our [FPGA'17
paper][4]:

```
  @inproceedings{srivastava-facedetect-fpga2017,
    title     = {Accelerating Face Detection on Programmable SoC Using
                 C-Based Synthesis},
    author    = {Nitish Srivastava and Steve Dai and Rajit Manohar and
	         Zhiru Zhang},
    booktitle = {25\textsuperscript{th} ACM/SIGDA International
		 Symposium on Field-Programmable Gate Arrays},
    month     = {Feb},
    year      = {2017},
    doi       = {10.1145/3020078.3021753},
  }
```

 [4]: http://dx.doi.org/10.1145/3020078.3021753

**NOTE: If you face any issues while running anything in this repo, 
shoot me an email at nks45@cornell.edu**

facedetect-fpga is an open-source implementation of Viola-Jones face
detection algorithm suitable for C-based synthesis. It was introduced 
at FPGA-25 in February, 2017. 

This design explores the flow from software based implementation to 
an optimized C/C++ design suitable for High Level Synthesis (HLS) flow.
The face detection system is optimized for performance at the
C/C++ level and is synthesizable with a full-system compiler 
SDSoC from Xilinx. The design is suitable for real-time face detection 
applications and achieves a frame rate of 30 fps. The design has been 
tested using Vivado HLS 2016 and SDSoC 2016 on ZC-706 board with
Xilinx Zynq-7000 XC7Z045 FPGA and ARM Cortex-A9 CPU. The generated 
RTL code (SystemC, VHDL and Verilog) from the HLS compilers can 
potentially be used for other purposes as well. 

Tutorial
------------------------------------------------------------------------
### Pre-requisites

You will need to install Vivado-HLS and SDSoC to compile and run this
project.

### C-simulation

The face detection benchmark has been written in C. The current version
also supports C simulation so that functionality of the algorithm can be
tested without going through the painful process of bitstream generation.

To do the C-simulation of the design enter the following command in the
main directory:

    % make csim

This will create a hls.prj directory containing all the logs for the C
simulation. The output of the face-detection can be seen in the directory
hls.prj/solution/csim/build/Output.pgm Make sure that all the faces have
been detected and marked using rectangles.

One should always do C-simulation before synthesizing the design as it
is fast and detects the trivial syntactic/algorithmic errors.

### Bitstream generation

To synthesize the design for FPGA type the following command in the main
directory:

    % make

This will produce two folders:

    ./_sds
    ./sd_card

The \_sds folder will have all the reports generated by SDSoC and
Vivado-HLS.  The sd\_card folder will have the bitstream and the
associated files which will be used to port face-detection accelerator
onto the FPGA.

The timing and utilization reports generated by Vivado-HLS can be found
in the following directory:

    _sds/vhls/haar/solution/syn/report

The log generated by Vivado-HLS can be found as:

    _sds/vhls/vivado_hls.log

The area, utilzation and timing reports generated by SDSoC can be found
in the following directory:

    _sds/reports

For reference I have uploaded these two folders from my own compilation 
here: http://people.ece.cornell.edu/nks45/face-detect-compiled/

### Porting Face-detection accelerator onto the FPGA

To port the face-detection accelerator onto the FPGA copy all the files
in the sd\_card folder into the SD-card of the FPGA. After that you can
reboot the FPGA board and connect it to your computer using UART. Go to
the mnt folder and run face.elf binary

    $ cd /mnt
    $ ./face.elf

This will print the co-ordinates of all the rectangles detected on the
screen and will also give the time taken to detect the faces. Wait for a
while ( 4-5 seconds ) and remove the SD-card and insert it in your
computer. You will see an Output.pgm file with rectangles drawn around
all the faces. The wait time of 4-5 seconds is to make sure that this
image is written properly.

### Adding a new image

The face-detection algorithm can only process the images when each pixel
in the image is represented as a 8-bit number. The gen\_dataset folder
has the files to generate the hex images using the images in pgm format.
To add a new 320 X 240 image (pgm format) copy it to the folder
gen\_dataset. Edit the input and the output file names in the main
function in gen\_dataset/gen\_image.cpp and compile and run it:

    % cd gen_dataset
    % cp /path/to/image/test-image.pgm  .

    Compile the code
    % g++ gen_image.cpp -o gen_image
    % ./gen_image test-image.pgm test-image.h


This will produce the .h file for the image with the image pixels in the
hex format. Note that the original pgm image must be 320 X 240 pixels. 
Copy this .h file to the main directory:

    % cp test-image.h ../.

And then edit the main.cpp file to have this line instead:

    #include "image0_320_240.h"

Now you have added a new image. Do the C-simulation as described above.
Make sure the C-simulation is working and producing the right results.
Generate the bitstream and run it on the FPGA.


### Stages of Optimizations

The source code for different stages of optimizations mentioned in our
paper is also added to the repository so that users can get a feel of
the effects of each and every optimzation. There are 3 extra folders
which are added to the repo:

Baseline : This folder has the baseline implementation which replaces
all the non synthesizable constructs.

Pipelined : This folder has the code when all the classifiers are
pipelined.

Parallel\_and\_Pipeline : This folder has the code when the classifiers
in the first 3 stages are in parallel and the ones in  the other 22
stages are in pipeline.

The main folder has the code which combines all these optimizations.

### Resource Usage

The resource usage at each stage of optimization is given below:


License
------------------------------------------------------------------------

face-detection benchmark is offered under the terms of the Open Source
Initiative BSD 3-Clause License. More information about this license can
be found here:

 - http://choosealicense.com/licenses/bsd-3-clause
 - http://opensource.org/licenses/BSD-3-Clause

Owner

Name: Cornell Zhang Research Group
Login: cornell-zhang
Kind: organization

Website: https://zhang.ece.cornell.edu/
Repositories: 12
Profile: https://github.com/cornell-zhang

GitHub Events

Total

Watch event: 3
Fork event: 2

Last Year

Watch event: 3
Fork event: 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science