SIMD Vectorization

Demonstration of speed-up using SIMD vectorization in C compiled using GCC compiler flags and Intel SIMD Intrinsics. The simd.c program performs two types of matrix multiplications - 1. using No optimization, 2. using Intel SIMD SSE 2 Intrinsics. Similarly, the simd_add.c program performs vector addition.

How to use?

Works best on Unix-like systems. Have a GCC compiler installed and an Intel Processor.

To test the speed-up:

Compile any of the two programs with O3 flag or above (max O4) and msse2 flag to compile with optimize with SIMD vectorization options turned on.

$ gcc -O3 -msse2 -DCLS=$(getconf LEVEL1_DCACHE_LINESIZE) -o <outfile> <filname>.c

Run the binary and timings will be shown.

$ ./outfile

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
simd.c		simd.c
simd_add.c		simd_add.c
timer.h		timer.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SIMD Vectorization

How to use?

About

Uh oh!

Languages

Diamagnetic/simd_vectorization

Folders and files

Latest commit

History

Repository files navigation

SIMD Vectorization

How to use?

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages