G
geno_cyber
I'm pretty sure I read somewhere that the (new & improved) Prescotty
processor has been given a special hard wired instruction set which is
dedicated to encoding video, so that should speed things up some what.
I remember reading an article over a year ago which had Intel giving a
demo of a future release CPU which apparently was running 3 full screen
HD videos simultaneously rotating in a 3d cube. The processor prototype
was not specified, but it may have been a Tejas as it was rated at 5GHz.
SSE3 won't make Intel CPUs as fast as dedicated DSPs for video encoding. It can be an improvement
over SSE and SSE2 but it's still not fast enough.
They should have embedded a full DSP (or more than one) inside CPUs to achieve the same performance.
SSE subsets are still too much tied to the general purpose x86 architecture and their efficiency
it's poor compared to dedicated DSPs.
A $40-50 floating point DSP can be 3x times faster than any SSE3 capable CPU at MPEG2/MPEG4
encoding.
If it's true that Nvidia has designed the NV40 as a full DSP then it's just a matter of time and SDK
availability to let programmers access the NV40 DSP thru DirectX or other dedicated APIs before
known Codecs such as DiVX would be able to take advantage of GPU power.
The only problem is that Nvidia needs a mainstream set of GPUs derived from this one with MPEG
encoding/decoding on the market ASAP to set a standard, before ATI releases its own DSP GPUs with
MPEG encoding/decoding capability.
If the MPEG encoding/decoding in NV40 were fixed in hardware, hardwired then it would be a pretty
low quality implementation and I really hope that the claims that the GPU it's a full DSP are true
so that programmers with DSP experience could upload their own filters codes onto the GPU DSP to
perform their own MPEG video encoding.
I also hope that the SDK to access DSP features and reprogram MPEG video encoding would be free so
that even non-commercial, freeware encoders could be available in the future to further exploit GPU
capabilities.