# CMPPS
# Compare Packed Single-Precision Floating-Point Values
Opcode | Mnemonic | Description |
---|---|---|
0F C2 /r ib | CMPPS xmm1, xmm2/m128, imm8 | Compare packed single-precision floating-point values in xmm2/mem and xmm1 using imm8 as comparison predicate. |
Description | Pseudo-Op | Implementation |
---|---|---|
Performs an SIMD compare of the four packed single-precision floating-point values in the source operand (second operand) and the destination operand (first operand) and returns the results of the comparison to the destination operand. The comparison predicate operand (third operand) specifies the type of comparison performed on each of the pairs of packed values. The result of each comparison is a doubleword mask of all 1s (comparison true) or all 0s (comparison false). The source operand can be an XMM register or a 128-bit memory location. The destination operand is an XMM register. The comparison predicate operand is an 8-bit immediate the first 3 bits of which define the type of comparison to be made (see the following table); bits 4 through 7 of the immediate are reserved. The unordered relationship is true when at least one of the two source operands being compared is a NaN; the ordered relationship is true when neither source operand is a NaN. A subsequent computational instruction that uses the mask result in the destination operand as an input operand will not generate a fault, because a mask of all 0s corresponds to a floating-point value of +0.0 and a mask of all 1s corresponds to a QNaN. Some of the comparisons listed in the following table (such as the greater-than, greater-than-or-equal, notgreater- than, and not-greater-than-or-equal relations) can be made only through software emulation. For these comparisons the program must swap the operands (copying registers when necessary to protect the data that will now be in the destination), and then perform the compare using a different predicate. The predicate to be used for these emulations is listed in the following table under the heading Emulation. Compilers and assemblers may implement the following two-operand pseudo-ops in addition to the three-operand CMPPS instruction. See the following table.
Pseudo-OpImplementation CMPEQPS xmm1, xmm2CMPPS xmm1, xmm2, 0 CMPLTPS xmm1, xmm2CMPPS xmm1, xmm2, 1 CMPLEPS xmm1, xmm2CMPPS xmm1, xmm2, 2 CMPUNORDPS xmm1, xmm2CMPPS xmm1, xmm2, 3 CMPNEQPS xmm1, xmm2CMPPS xmm1, xmm2, 4 CMPNLTPS xmm1, xmm2CMPPS xmm1, xmm2, 5 CMPNLEPS xmm1, xmm2CMPPS xmm1, xmm2, 6 CMPORDPS xmm1, xmm2CMPPS xmm1, xmm2, 7
The greater-than relations not implemented by the processor require more than one instruction to emulate in software and therefore should not be implemented as pseudo-ops. (For these, the programmer should reverse the operands of the corresponding less than relations and use move instructions to ensure that the mask is moved to the correct destination register and that the source operand is left intact.) |CMPEQPS xmm1, xmm2|CMPPS xmm1, xmm2, 0|CMPLTPS xmm1, xmm2|CMPPS xmm1, xmm2, 1|CMPLEPS xmm1, xmm2|CMPPS xmm1, xmm2, 2|CMPUNORDPS xmm1, xmm2|CMPPS xmm1, xmm2, 3|CMPNEQPS xmm1, xmm2|CMPPS xmm1, xmm2, 4|CMPNLTPS xmm1, xmm2|CMPPS xmm1, xmm2, 5|CMPNLEPS xmm1, xmm2|CMPPS xmm1, xmm2, 6|CMPORDPS xmm1, xmm2|CMPPS xmm1, xmm2, 7| | |CMPEQPS xmm1, xmm2|CMPPS xmm1, xmm2, 0| |CMPLTPS xmm1, xmm2|CMPPS xmm1, xmm2, 1| |CMPLEPS xmm1, xmm2|CMPPS xmm1, xmm2, 2| |CMPUNORDPS xmm1, xmm2|CMPPS xmm1, xmm2, 3| |CMPNEQPS xmm1, xmm2|CMPPS xmm1, xmm2, 4| |CMPNLTPS xmm1, xmm2|CMPPS xmm1, xmm2, 5| |CMPNLEPS xmm1, xmm2|CMPPS xmm1, xmm2, 6| |CMPORDPS xmm1, xmm2|CMPPS xmm1, xmm2, 7|
# Operation
switch(ComparisonPredicate) {
case 0:
Operator = OperatorEqual;
break;
case 1:
Operator = OperatorLessThan;
break;
case 2:
Operator = OperatorLessOrEqual;
break;
case 3:
Operator = OperatorUnordered;
break;
case 4:
Operator = OperatorNotEqual;
break;
case 5:
Operator = OperatorNotLessThan;
break;
case 6:
Operator = OperatorNotLessOrEqual;
break;
case 7:
Operator = OperatorOrdered;
break;
}
CMP0 = Destination[0..31] Operator Source[0..31];
CMP1 = Destination[32..63] Operator Source[32..63];
CMP2 = Destination[64..95] Operator Source[64..95];
CMP4 = Destination[96..127] Operator Source[96..127];
if(CMP0 == true) Destination[0..31] = 0xFFFFFFFF;
else Destination[0..31] = 0;
if(CMP1 == true) Destination[32..63] = 0xFFFFFFFF;
else Destination[32..63] = 0;
if(CMP2 == true) Destination[64..95] = 0xFFFFFFFF;
else Destination[64..95] = 0;
if(CMP3 == true) Destination[96..127] = 0xFFFFFFFF;
else Destination[96..127] = 0;
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
# SIMD Floating-Point Exceptions
Invalid if SNaN operand and invalid if QNaN and predicate as listed in above table, Denormal.
# Protected Mode Exceptions
# Real-Address Mode Exceptions
# Virtual-8086 Mode Exceptions
Same exceptions as in Real Address Mode
#PF(fault-code) | For a page fault. |
Instruction | Latency | Throughput | Execution Unit |
---|---|---|---|
CPUID | 0F3n/0F2n/069n | 0F3n/0F2n/069n | 0F2n |
CMPPS xmm, xmm | 5/4/4 | 2/2/2 | FP_ADD |