x86 Instruction Set Reference
MOVSHDUP
Move Packed Single-FP High and Duplicate
| Opcode | Mnemonic | Description |
|---|---|---|
F3 0F 16 /r |
MOVSHDUP xmm1, xmm2/m128 |
Move two single-precision floating-point values from the higher 32-bit operand of each qword in xmm2/m128 to xmm1 and duplicate each 32-bit operand to the lower 32-bits of each qword. |
| Description |
|---|
|
The linear address corresponds to the address of the least-significant byte of the referenced memory data. When a memory address is indicated, the 16 bytes of data at memory location m128 are loaded and the single-precision elements in positions 1 and 3 are duplicated. When the register-register form of this operation is used, the same operation is performed but with data coming from the 128-bit source register. |
| Operation |
|---|
if(source == m128) { //load instruction xmm1[0..31] = m128[32..63]; xmm1[32..63] = m128[32..63] xmm1[64..95] = m128[96..127]; xmm1[96..127] = m128[96..127]; } else { //move instruction xmm1[0..31] = xmm2[32..63]; xmm1[32..63] = xmm2[32..63]; xmm1[64..95] = xmm2[96..127]; xmm1[96..127] = xmm2[96..127]; } |
| Exceptions |
|---|
| General protection exception if not aligned on 16-byte boundary, regardless of segment. |
| Numeric Exceptions |
|---|
| None. |
| Protected Mode Exceptions | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
| Real-Address Mode Exceptions | ||||||
|---|---|---|---|---|---|---|
|
| Virtual-8086 Mode Exceptions | ||||||||
|---|---|---|---|---|---|---|---|---|
|
| Instruction | Latency | Throughput | Execution Unit |
|---|---|---|---|
CPUID | 0F3n | 0F3n | 0F3n |
MOVSHDUP xmm1, xmm2 | 6 | 2 | FP_MOVE |