DirectX Video Acceleration
DirectX Video Acceleration (DXVA) is a Microsoft API specification for the Microsoft Windows and Xbox 360 platforms that allows video decoding to be hardware accelerated. The pipeline allows certain CPU-intensive operations such as iDCT, motion compensation and deinterlacing to be offloaded to the GPU. DXVA 2.0 allows more operations, including video capturing and processing operations, to be hardware accelerated as well.
DXVA works in conjunction with the video rendering model used by the video card. DXVA 1.0, which was introduced as a standardized API with Windows 2000 and is currently available on Windows 98 or later, can use either the overlay rendering mode or VMR 7/9. DXVA 2.0, available only on Windows Vista, Windows 7, Windows 8 and later OSs, integrates with Media Foundation (MF) and uses the Enhanced Video Renderer (EVR) present in MF.
The DXVA is used by software video decoders to define a codec-specific pipeline for hardware-accelerated decoding and rendering of the codec. The pipeline starts at the CPU which is used for parsing the media stream and conversion to DXVA-compatible structures. DXVA specifies a set of operations that can be hardware accelerated and device driver interfaces (DDIs) that the graphic driver can implement to accelerate the operations. If the codec needs to do any of the defined operations, it can use these interfaces to access the hardware-accelerated implementation of these operations. If the graphic driver does not implement one or more of the interfaces, it is up to the codec to provide a software fallback for it. The decoded video is handed over to the hardware video renderer where further video post-processing might be applied to it before being rendered to the device. The resulting pipeline is usable in a DirectShow compatible application.
DXVA specifies the Motion Compensation DDI, which specifies the interfaces for iDCT operations, Huffman coding, motion compensation, alpha blending, inverse quantization, color space conversion and frame-rate conversion operations, among others. It also includes three sub-specifications: Deinterlacing DDI, COPP DDI and ProcAmp DDI. The Deinterlacing DDI specifies the callbacks for deinterlacing operations. The COPP (Certified Output Protection Protocol) DDI functions allow the pipeline to be secured for DRM-protected media, by specifying encryption functions. The ProcAmp DDI is used to accelerate post-processing video. The ProcAmp driver module sits between the hardware video renderer and the display driver and provides functions for applying post-processing filters on the decompressed video.
The functions exposed by DXVA DDIs are not accessible directly by a DirectShow client, but are supplied as callback functions to the video renderer. As such, the renderer plays a very important role in anchoring the pipeline.
DXVA on Windows Vista and later 
DXVA 2.0 enhances the implementation of the video pipeline and adds a host of other DDIs, including a Capture DDI for video capture. The DDIs it shares with DXVA 1.0 are also enhanced with the ability to use hardware acceleration of more operations. Also, the DDI functions are directly available to callers and need not be mediated by the video renderer. As such, a program can also create a pipeline for simply decoding the media (without rendering) or post-processing and rendering (without decoding). These features require the Windows Display Driver Model drivers, which limits DXVA 2.0 to Windows Vista, Windows Server 2008, Windows 7, Windows Server 2008 R2 and Windows 8. On Windows XP and Windows 2000, programs can use DVA 1.0. DXVA 2.0 allows Enhanced Video Renderer as the video renderer only on Vista, Windows 7, and Windows 8. (With Windows XP, DXVA-Rendering is possible with VMR9 and the well-known Overlay Mixer.) DXVA integrates with Media Foundation and allows DXVA pipelines to be exposed as Media Foundation Transforms (MFTs). Even decoder pipelines or post-processing pipelines can be exposed as MFTs, which can be used by the Media Foundation topology loader to create a full media playback pipeline. DXVA 1.0 is emulated using DXVA 2.0. DXVA 2.0 does not include the COPP DDI, rather it uses PVP for protected content. Windows 7 implements DXVA-HD if the driver complies with WDDM 1.1.
DXVA2 implementations: native and copy-back 
DXVA2 implementations come in two variants: native and copy-back.
With the native implementation, the decoded video stays in GPU memory until it has been displayed. The video decoder must be connected to the video renderer. There are to be no processing filter in between. The video renderer must also support DXVA, which give you less freedom of in the choice of the renderers.
With the copy-back implementation, the decoded video is copied from the GPU memory back to the CPU memory. This implementation doesn't have the limitations mentioned above, acts similar to a normal software decoder, but does require a GPU fast enough in copying data from the GPU memory back to the CPU memory; otherwise the result will be stuttering video playback.
Obviously, if a user has no need for customized processing, then it makes no sense to use copy-back mode as the copy-back operations increases GPU memory load.
GPUs that should be fast enough are:
- AMD: Radeon HD 6xxx and newer
- Nvidia: Nvidia GeForce 500 Series and newer
- Intel: Intel HD Graphics 2000 and newer
- Media Player Classic Home Cinema
- ffdshow-tryouts (Since Revision 3185)
- XBMC Media Center
- Microsoft Windows Vista/Windows 7/Windows 8 internal MPEG-2 decoder
- Nero Showtime / Nero MediaHub / Nero Kwik Media
- Windows Media Player 11 (WMV3 only)
- Windows Media Player 12
- Adobe Flash version 10.3
- DivX H.264 Decoder (version 1.2, part of the DivX Plus software)
- VLC media player (DXVA 2.0 only) (since version 1.1)
- CoreAVC (since version 2.5.0)
- Elecard MPEG-2 PlugIn for WMP and MPEG Player
- Freemake Video Converter (Since version 2.2 beta)
- Mirillis Splash 
- Daum PotPlayer 
See also 
- Nvidia PureVideo - the bit-stream technology from NVIDIA used in their graphics chips to accelerate video decoding on hardware GPU with DXVA.
- UVD (Unified Video Decoder) - is the video decoding bit-stream technology from ATI Technologies to use hardware (GPU) decode with DXVA.
- Intel Clear Video - Intel's hardware video decoding technology
- Media Foundation (and its Enhanced Video Renderer) which DXVA 2.0 uses.
- VDPAU (Video Decode and Presentation API for Unix)
- X-Video Bitstream Acceleration (XvBA), the X11 equivalent of DXVA for MPEG-2, H.264, and VC-1
- X-Video Motion Compensation, the X11 equivalent for MPEG-2 video codec only
- Video Acceleration API (VA API)
- Distributed Codec Engine (libcde) is a Texas Instruments API for the video codec engine in OMAP based embedded systems
- Video Decode Acceleration Framework is Apple Inc.s API for hardware-accelerated decoding of H.264 on Mac OS X
- VideoToolBox is an undocumented API from Apple Inc. for hardware-accelerated decoding on Apple TV and Mac OS X 10.5 or later.
- OpenVideo Decode (OVD) – a new open cross-platform video acceleration API from AMD.
- OpenMAX IL (Open Media Acceleration Integration Layer) - a royalty-free cross-platform media abstraction API from the Khronos Group
- "DirectX Video Acceleration 2.0". Retrieved 2007-10-24.
- "Introduction to DirectX VA". Retrieved 2007-10-24.
- "Microsoft DirectX Video Acceleration (DirectX VA) support". Retrieved 2007-10-24.
- "DirectX Video Acceleration". Retrieved 2007-10-24.
- "What's New in DirectShow". Retrieved 2007-10-24.
- ASRock's High-End Vision 3D 252B HTPC Review: HTPC Decoding & Rendering Benchmarks: madVR
- http://www.tuaw.com/2011/01/20/xbmc-for-ios-and-atv2-now-available/ XBMC for iOS and Apple TV now available
- http://developer.amd.com/gpu/AMDAPPSDK/assets/OpenVideo_Decode_API.PDF OpenVideo Decode (OVD) API
- DirectX Video Acceleration
- DXVAChecker, utility listing DXVA modes of which the given computer is capable