MMX |
| title |
order # |
date |
on-line? (FILE or HTTP) |
| overview |
243081-002 |
06/97 |
PDF |
PDF |
| programmer's reference |
243007-002 |
03/96 |
PDF |
PDF |
| developer's guide |
243006-001 |
03/96 |
PDF |
PDF |
| at a glance |
243100-003 |
06/97 |
PDF |
PDF |
| in business applications |
243367-002 |
06/97 |
PDF |
PDF |
| optimized string search with MMX technology |
??? |
??? |
PDF |
PDF |
| new EMMS usage guidelines |
??? |
03/96 |
PDF |
PDF |
| how to use FP or MMX in ring 0 or a VxD under Windows 95 |
??? |
03/96 |
PDF |
PDF |
| survey of Pentium PeMo capabilities and tools |
??? |
03/96 |
PDF |
PDF |
| implement data alignment |
??? |
03/96 |
PDF |
PDF |
| get bits from a data stream |
??? |
03/96 |
PDF |
PDF |
| complex 16-bit FFT |
??? |
03/96 |
PDF |
PDF |
| transpose a matrix |
??? |
03/96 |
PDF |
PDF |
| compute a 16-bit vector |
??? |
03/96 |
PDF |
PDF |
| compute the L1 norm between two 16-bit vectors |
??? |
03/96 |
PDF |
PDF |
| compute the L2 norm between two 16-bit vectors |
??? |
03/96 |
PDF |
PDF |
| simple vector operations |
??? |
03/96 |
PDF |
PDF |
| efficient vector/matrix multiply routine |
??? |
03/96 |
PDF |
PDF |
| 16-bit x 31-bit multiplication |
??? |
03/96 |
PDF |
PDF |
| fractals |
??? |
03/96 |
PDF |
PDF |
| column filter |
??? |
03/96 |
PDF |
PDF |
| median filter |
??? |
03/96 |
PDF |
PDF |
| row filter |
??? |
03/96 |
PDF |
PDF |
| 16-bit FIR filter |
??? |
03/96 |
PDF |
PDF |
| SchurWeiner filter |
??? |
03/96 |
PDF |
PDF |
| video loop filter |
??? |
03/96 |
PDF |
PDF |
| convert 24-bit true color to 16-bit high color |
??? |
03/96 |
PDF |
PDF |
| convert YUV12 to RGB |
??? |
03/96 |
PDF |
PDF |
| convert RGB to YUV |
??? |
03/96 |
PDF |
PDF |
| bilinear interpolation of video RGB values |
??? |
03/96 |
PDF |
PDF |
| 2x 8-bit image scaling |
??? |
03/96 |
PDF |
PDF |
| alpha blending |
??? |
03/96 |
PDF |
PDF |
| Gouraud shading |
??? |
03/96 |
PDF |
PDF |
| 2x2 Haar transform |
??? |
03/96 |
PDF |
PDF |
| procedural texture mapping |
??? |
03/96 |
PDF |
PDF |
| advanced procedural texturing |
??? |
03/96 |
PDF |
PDF |
| 2D sprite overlay |
??? |
03/96 |
PDF |
PDF |
| AGP and 3D graphics software |
??? |
03/96 |
PDF |
PDF |
| 3D bilinear texture mapping |
??? |
03/96 |
PDF |
PDF |
| 3D geometry transformations |
??? |
03/96 |
PDF |
PDF |
| 3D rendering |
??? |
03/96 |
PDF |
PDF |
| absolute difference in motion estimation |
??? |
03/96 |
PDF |
PDF |
| motion compensation for MPEG1 video playback |
??? |
03/96 |
PDF |
PDF |
| fast iDCT algorithm for MPEG decoding |
??? |
03/96 |
PDF |
PDF |
| 1/3T equalizer |
??? |
03/96 |
PDF |
PDF |
| 2/3T equalizer |
??? |
03/96 |
PDF |
PDF |
| echo sound effects |
??? |
03/96 |
PDF |
PDF |
| synthesis sub-band filter for MPEG audio decoding |
??? |
03/96 |
PDF |
PDF |
| G.728 codebook search |
??? |
03/96 |
PDF |
PDF |
| Levinson Durbin algorithm |
??? |
03/96 |
PDF |
PDF |
| Viterbi decoding |
??? |
03/96 |
PDF |
PDF |
| modem baseband canceler |
??? |
03/96 |
PDF |
PDF |
| modem passband canceler |
??? |
03/96 |
PDF |
PDF |
| MMX code macro file for use with Microsoft MASM assembler |
IAMMX.INC |
03/97 |
TXT |
TXT |
| MMX code macro file for use with Borland TASM assembler (v5.0) |
IBMMX.INC |
03/97 |
TXT |
TXT |
| MMX code macro file for use with Borland TASM assembler (older than v5.0) |
ICMMX.INC |
03/97 |
TXT |
TXT |
SSE |
| title |
order # |
date |
on-line? (FILE or HTTP) |
| software conventions 2.1 (AP-589) |
243873-002 |
01/99 |
PDF |
PDF |
| diffuse directional lighting 1.3 (AP-596) |
243630-004 |
01/99 |
PDF |
PDF |
| 3D transformation 1.3 (AP-597) |
243631-004 |
01/99 |
PDF |
PDF |
| FIR and IIR filtering 1.1 (AP-598) |
243547-002 |
01/99 |
PDF |
PDF |
| course-grain multithreading 1.1 (AP-802) |
243636-002 |
01/99 |
PDF |
PDF |
increasing the accuracy of the results from the reciprocal and reciprocal square root
instructions using the Newton-Raphson method 2.1 (AP-803)
|
243637-002 |
01/99 |
PDF |
PDF |
| integer minimum or maximum element search 2.1 (AP-804) |
243638-002 |
01/99 |
PDF |
PDF |
find the maximum/minimum element of single precision FP vector and its corresponding
index 1.2 (AP-805)
|
243639-002 |
01/99 |
PDF |
PDF |
| antialiasing 2.1 (AP-806) |
243640-002 |
01/99 |
PDF |
PDF |
| Wiener filtering 2.1 (AP-807) |
243641-001 |
01/99 |
PDF |
PDF |
| split-radix fast Fourier transformation 2.1 (AP-808) |
243642-002 |
01/99 |
PDF |
PDF |
| 32-bit FP real & complex 16-tap FIR filter 2.1 (AP-809) |
243643-002 |
01/99 |
PDF |
PDF |
| evaluate a hidden Markov model with Viterbi decoding 2.1 (AP-811) |
243645-002 |
01/99 |
PDF |
PDF |
| collision detection 1.1 (AP-812) |
243646-002 |
01/99 |
PDF |
PDF |
| software development strategies 2.1 (AP-814) |
243648-002 |
01/99 |
PDF |
PDF |
| fast DCT algorithm for MPEG encoding 1.2 (AP-817) |
243651-002 |
01/99 |
PDF |
PDF |
| motion estimation algorithm for MPEG encoding 1.2 (AP-818) |
243652-002 |
01/99 |
PDF |
PDF |
| phong equation 2.1 (AP-819) |
243653-002 |
01/99 |
PDF |
PDF |
| RGB alpha saturation 2.1 (AP-820) |
243654-002 |
01/99 |
PDF |
PDF |
| 1/3 T software equalizer 1.1 (AP-821) |
243655-002 |
01/99 |
PDF |
PDF |
| a smoothed upsample algorithm 2.1 (AP-822) |
243656-002 |
01/99 |
PDF |
PDF |
| deformable surfaces 1.1 (AP-832) |
243631-004 |
01/99 |
PDF |
PDF |
| data alignment and programming issues with the Intel C/C++ compiler 1.1 (AP-833) |
243872-002 |
01/99 |
PDF |
PDF |
| identifying support for SSE in the processor and OS 1.1 (AP-900) |
244413-002 |
01/99 |
PDF |
PDF |
fast precise implementation of 8x8 discrete cosine transformation using SSE and MMX
instructions 1.0 (AP-922)
|
742474-001 |
04/99 |
PDF |
PDF |
| inverse of 4x4 matrix (AP-928) |
245043-001 |
03/99 |
PDF |
PDF |
| inverse of 6x6 matrix (AP-929) |
245044-001 |
03/99 |
PDF |
PDF |
| matrix multiplication (AP-930) |
245045-001 |
06/99 |
PDF |
PDF |
| LU decomposition (AP-931) |
245046-001 |
06/99 |
PDF |
PDF |
| block copy using streaming SIMD extensions |
??? |
01/99 |
PDF |
PDF |
| streaming SIMD extensions and general vector operations |
??? |
02/99 |
PDF |
PDF |
| optimizing vector magnitude |
??? |
10/00 |
PDF |
PDF |
| SSE code macro file for use with Microsoft MASM assembler |
IAXMM.INC |
02/99 |
TXT |
TXT |
| streaming SIMD instructions missing on initial Pentium III stepping (A0) |
AMATHS.C |
08/99 |
TXT |
TXT |
SSE2 |
| title |
order # |
date |
on-line? (FILE or HTTP) |
| using SSE2 for SAXPY/DAXPY 2.0 (AP-935) |
248600-001 |
07/00 |
PDF |
PDF |
| using SSE2 for single-point crossovers in binary genetic algorithms 2.0 (AP-936) |
248601-001 |
07/00 |
PDF |
PDF |
| using SSE2 to find the max/min el. of a DP FP vector and its corr. index 2.0 (AP-937) |
248602-001 |
07/00 |
PDF |
PDF |
| using SSE2 in a DP 3D transform 2.0 (AP-939) |
248604-001 |
07/00 |
PDF |
PDF |
| using SSE2 for block-matching in motion estimation algorithms 2.0 (AP-940) |
248605-001 |
07/00 |
PDF |
PDF |
| using SSE2 to perform big multiplications 2.0 (AP-941) |
248606-001 |
07/00 |
PDF |
PDF |
| using SSE2 in motion compensation for video decoding and encoding 2.0 (AP-942) |
248607-001 |
07/00 |
PDF |
PDF |
| FP arithmetic using the FPU, SSE, and SSE2 2.0 (AP-943) |
248608-001 |
07/00 |
PDF |
PDF |
| perf. comp. of uniform NURBS surface tessellation on the P3 and P4 2.0 (AP-944) |
248609-001 |
07/00 |
PDF |
PDF |
| using SSE2 to implement an IDC transform 2.0 (AP-945) |
248670-001 |
07/00 |
PDF |
PDF |
| using SSE2 to evaluate a hidden Markov model with Viterbi decoding 2.0 (AP-946) |
248671-001 |
07/00 |
PDF |
PDF |
| using SSE and SSE2: misconceptions and reality |
??? |
03/01 |
PDF |
PDF |
| using spin-loops on P4 and P4 Xeon processors 2.1 (AP-949) |
248674-002 |
05/01 |
PDF |
PDF |