Sitemap

Home

Eric Bainville

FPGA Simple UART
 • Introduction
 • Serial I/O
 • Reception
 • Transmission
 • Testing

OpenCL Sorting
 • Introduction
 • Parallel selection
 • Parallel selection, local
 • Parallel bitonic, local
 • Parallel bitonic I
 • Parallel bitonic II
 • Parallel merge, local

OpenCL FFT
 • Introduction
 • DFT
 • FFT
 • Reference implementations
 • Single and double precision
 • Radix-2 kernel
 • Higher radix kernels
 • Benchmarks

OpenCL FFT (OLD)
 • Introduction
 • DFT
 • FFT
 • Reference implementations
 • Radix-2 kernel
 • Radix 4,8,16,32 kernels
 • Radix-r kernels benchmarks
 • One work-group per DFT (1)
 • One work-group per DFT (2)

OpenCL GEMV
 • Introduction
 • One thread per dot product
 • P threads per dot product
 • Two kernels
 • Conclusions

OpenCL Multiprecision
 • Introduction
 • Multiplication Algorithms
 • Schönhage-Strassen
 • OpenCL SSA I
 • OpenCL SSA II

GPU Mandelbrot Set
 • Introduction
 • Simple C Code
 • Simple OpenCL Code
 • Fixed Point Reals
 • fp128 for OpenCL
 • Benchmarks

GPU Benchmarks
 • Introduction
 • Memory operations
 • Addition
 • Available Flops
 • Product by one digit

SSE gradient
 • Introduction
 • C implementation
 • SSE instructions
 • SSE right shift

SSE dot product
 • Introduction
 • C implementation
 • SSE code

AMD64 Multiprecision
 • Introduction
 • Memory Zero
 • Unary OP
 • Memory Copy
 • Binary OP
 • Binary OP NOT
 • Scaling
 • Multiply and accumulate
 • Left and right shifts
 • Division by a scalar

Intel64 Multiprecision
 • Introduction
 • Memory Zero
 • Unary OP
 • Memory Copy
 • Binary OP

POV-Ray Buttons/Logos
 • Introduction
 • Shapes
 • Content
 • Colors
 • Sky reflections
 • Glowing effect

Projective Geometry
 • Introduction
 • Points and Planes
 • Duality
 • Lines I
 • Tools for Plücker coordinates
 • Lines II
 • Transformations
 • Polarities
 • Quadrics I

Quaternions
 • Definitions
 • Quaternions and Rotations
 • Conversions