- Multifunkciós adattároló érkezett a Patriot színeiben
- Egyedi megjelenésű Radeon VGA a Yeston és a GravaStar keze nyomán
- 5:4 képarányú SXGA monitor jön ősszel az EIZO berkeiből
- Úgy tér vissza a Commodore 64, ahogy titkon mindenki várja
- A hírek szerint a jövő év végére érkezhet a sokat pletykált NVIDIA N1X SoC
- Milyen belső merevlemezt vegyek?
- TCL LCD és LED TV-k
- nVidia GeForce GTX 1070 / 1070 Ti / 1080 (GP104)
- Elképesztően drága az új Ryzen Threadripper PRO generáció
- Iszonyatos mennyiségű hulladékkal járhat a Windows 10 terméktámogatásának vége
- AMD Ryzen 9 / 7 / 5 9***(X) "Zen 5" (AM5)
- 3D nyomtatás
- Vezetékes FEJhallgatók
- Házi barkács, gányolás, tákolás, megdöbbentő gépek!
- Fujifilm X
Új hozzászólás Aktív témák
-
S_x96x_S
addikt
néha érdemes újraolvasgani ...
The microarchitecture of Intel, AMD and VIA CPUs
By Agner Fog. Technical University of Denmark.
Copyright © 1996 - 2020. Last updated 2020-03-08.https://www.agner.org/optimize/microarchitecture.pdf
22.3 The AMD Zen kernel
The Ryzen processor put AMD back in the game after they had lagged behind Intel for
several years. The Zen kernel has a throughput of five instructions per clock cycle, which is
the record so far. The throughput is particularly high for 128-bit vector code. The Ryzen can
calculate four 128-bit floating point vectors per clock cycle, or two 256-bit vectors.
The high throughput places a higher burden on programmers and compilers to utilize the
increased instruction level parallelism in single threaded applications. The core throughput
is so high that it makes good sense to run two threads per core, unlike some other
processors with less core throughput which are likely to see a serious performance drop
due to the two threads competing for the limited resources.
The new μop cache is an important improvement which removes the bottleneck of
instruction fetching and decoding in most of the critical loops.
The large caches at all levels is a particularly important improvement. But the cache
bandwidth is limited to 32 bytes per clock which is less than the best competing Intel
processors.-------------
For many years, the RISC philosophy has been considered the best way to high
performance. The Intel Core microarchitecture as well as the AMD design indicates a new
trend away from RISC and back to the CISC principle. The RISC-like design of the P4 with
its very long pipeline and long execution latencies was not convincing, and the trace cache
appears to have been inefficient. The advantages of going back to CISC design are
threefold:
1. The compact CISC code gives better utilization of the limited code cache area.
2. Fewer μops per instruction gives higher bandwidth in the pipeline.
3. Fewer μops per instruction gives lower power consumption.
The main disadvantage of a CISC design is that instruction decoding becomes a bottleneck.
The AMD processor has a higher decoding bandwidth than the Intel design because of the
technique of storing instruction boundaries in the code cache. The Intel design is still limited
to a decoding rate of 16 bytes of code per clock cycle, which is insufficient in many cases.-------------
"There is a remarkable convergence between the Intel and AMD microarchitectures thanks
to a patent sharing agreement between the two companies. Intel's stack engine and the
mechanism for predicting indirect branches have been copied by AMD. We can also expect
Intel to some day match AMD's 32-bytes instruction fetch rateThere is unfortunately not always convergence on the instruction set extensions. Intel's
SSE4.1 and SSE4.2 instruction sets are very different from AMD's SSE4.A and XOP
(formerly known as SSE5), and the intersection between these two sets is quite small. Intel
has never copied AMD's 3DNow instruction set which is now obsolete, but they have copied
the successful x64 extension from AMD and a few other instructions. AMD has traditionally
copied all Intel instructions, but sometimes with a lag of several years. Fortunately, AMD
has revised their proposed SSE5 instruction set to make it compatible with the AVX coding
scheme (as I have previously argued that it would be wise of them to do). The latest AMD
processors now support Intel's AVX2 instruction set, including the 256-bit YMM vector
registers. It may take several years before AMD supports AVX512.
"
Új hozzászólás Aktív témák
- Milyen belső merevlemezt vegyek?
- TCL LCD és LED TV-k
- Háztartási gépek
- PlayStation 5
- Hyundai, Kia topik
- Reklámblokkolók topikja
- gban: Ingyen kellene, de tegnapra
- One otthoni szolgáltatások (TV, internet, telefon)
- nVidia GeForce GTX 1070 / 1070 Ti / 1080 (GP104)
- Elképesztően drága az új Ryzen Threadripper PRO generáció
- További aktív témák...
- Új Honor 400 pro + 66 wattos töltőfej
- ASUS TUF Gaming A620M-Plus + AMD Ryzen 5 8400F + 16GB 5600MHz Kingston Fury Beast + hűtő
- Eladó Erős Gamer PC - Akár Cyberpunk 2077-re Is Készen! (Ryzen 3 2200G RX 570 8GB 32GB RAM AIO
- MacBook Pro 2019 '16 2,6 Ghz 6mag Intel Core i7 16/512gb - Space Gray Touch Bar
- Xiaomi 13T 256GB, Kártyafüggetlen, 1 Év Garanciával
- PS3 Játékok 1500Ft/db - RÉSZLETEK A LEÍRÁSBAN
- AKCIÓ! MSI B450 R5 5500 16GB DDR4 512GB SSD RTX 2060 Super 8GB GDDR6 Rampage Shiva Zalman 500W
- BESZÁMÍTÁS! Samsung Galaxy S24 Ultra 5G 256GB okostelefon garanciával hibátlan működéssel
- Bomba ár! HP ProBook 440 G7 - i5-10GEN I 8GB I 256SSD I HDMI I 14" FHD I Cam I W11 I Gar
- Wilbur Smith könyvek (15 db) egyben
Állásajánlatok
Cég: PCMENTOR SZERVIZ KFT.
Város: Budapest