Taint analysis in Double-Precision Floating-Point instruction #471

pfsun · 2017-01-18T16:15:24Z

@JonathanSalwan For taint analysis in double-precisson floating-point instruction (e.g. subsd, mulsd, addsd, divsd), I must firstly provide the correct semantics, and then I can do taint analysis in these instructions, right?

JonathanSalwan · 2017-01-18T16:21:03Z

Right but i'm very curious about how you will provide the good semantics :). If you have ideas go ahead!

pfsun · 2017-01-18T16:33:25Z

Actually, I don't have good idea now. I just want to confirm with you firstly. And then exploring and trying the possible way. I also wonder whether you have any tips to share with me? I see you have build the semantic for movsd.

JonathanSalwan · 2017-01-18T16:38:39Z

actually, if you only want the taint analysis you can only call taintAssignement, taintUnion etc... But if you want the symbolic execution you have to define the semantics. The problem is described here.

pfsun · 2017-01-18T16:43:25Z

I know the problem for symbolic execution. Currently, I just need to do taint and no symbolic execution. I will try taintAssignment and taintUnion. Thanks Jonathan.

JonathanSalwan · 2017-01-18T16:46:06Z

Okay. Below an example for the addsd tainting:

+      void x86Semantics::addsd_s(triton::arch::Instruction& inst) {
+        auto& dst = inst.operands[0];
+        auto& src = inst.operands[1];
+
+        /* Spread taint */
+        this->taintEngine->taintUnion(dst, src);
+
+        /* Upate the symbolic control flow */
+        this->controlFlow_s(inst);
+      }
+
+

pfsun · 2017-01-18T19:53:50Z

Great. I just go to one class. I will try them and then let you know. But it looks that isTainted() also need to check the symbolic expression, and need the semantic, right?

JonathanSalwan · 2017-01-18T20:11:19Z

Indeed. You have to define instruction->tainted to true. There is probably a patch to do out there.

JonathanSalwan · 2017-01-18T20:17:09Z

Please git pull origin dev-next. Then:

+      void x86Semantics::addsd_s(triton::arch::Instruction& inst) {
+        auto& dst = inst.operands[0];
+        auto& src = inst.operands[1];
+
+        /* Spread taint */
+        inst.setTaint(this->taintEngine->taintUnion(dst, src));
+
+        /* Upate the symbolic control flow */
+        this->controlFlow_s(inst);
+      }
+
+

pfsun · 2017-01-18T20:44:39Z

The taint works prefect now. Thanks for you help.

pfsun · 2017-01-18T21:13:48Z

Thanks Jonathan. Now I can taint the four float-point instruction. One more thing, I don't need the symbolic execution for float-point instruction. But I do need the semantic expression for them. Do you plan to add them in some point? I am also exploring them now.

JonathanSalwan · 2017-01-19T08:49:10Z

Do you plan to add them in some point?

It's a long term goal but it's a bit complex to do and I have so much easy things to do before :)

pfsun · 2017-02-17T19:18:04Z

Hi @JonathanSalwan, now I start to explore SUBSD xmm1, xmm2/m64. I stopped on SUBSD xmm1, m64.
e.g.
4050a0: subsd xmm0, qword ptr [rbp - 0x60]
ref_87 = (ref_85 - ref_31) # SUBSD operation //Simplified Symbolic Expression
SymVar_0:128 for xmm0, SymVar_1:64 for qword ptr [rbp - 0x60]
I convert Xmm0 to symbolic variable SymVar_0, and convert memory to symbolic variable SymVar_1.
When I try to get the FullAst, I only can get SymVar_0, and it misses SymVar_1. Because the correct output should be SymVar_0 - SymVar_1. The reason should be different size problem for xmm1 and m64. When I call getFullAst() to get full ast, it cannot output the correct symbolic expression. How can I avoid the problem? Thanks.

JonathanSalwan · 2017-02-18T10:36:09Z

Past your code.

pfsun · 2017-02-18T18:09:09Z

Now for SUBSD instruction, I just simply copy the code from sub_s and also disable the the code which is to decide whether the two node is the same size in BvsubNode::init. I know subsd_s should be different with sub_s. I just try firstly and will change it based on learning.

 void subsd_s(triton::arch::Instruction& inst) {
          auto& dst = inst.operands[0];
          auto& src = inst.operands[1];
          //auto bvSize = src.getBitSize();

          /* Create symbolic operands */
          auto op1 = triton::api.buildSymbolicOperand(inst, dst);
          auto op2 = triton::api.buildSymbolicOperand(inst, src);

          /* Create the semantics */
          auto node = triton::ast::bvsub(op1, op2);

          /* Create symbolic expression */
          auto expr = triton::api.createSymbolicExpression(inst, node, dst, "SUBSD operation");

          /* Spread taint */
          expr->isTainted = triton::api.taintUnion(dst, src);

          /* Upate symbolic flags */
          triton::arch::x86::semantics::af_s(inst, expr, dst, op1, op2);
          triton::arch::x86::semantics::cfSub_s(inst, expr, dst, op1, op2);
          triton::arch::x86::semantics::ofSub_s(inst, expr, dst, op1, op2);
          triton::arch::x86::semantics::pf_s(inst, expr, dst);
          triton::arch::x86::semantics::sf_s(inst, expr, dst);
          triton::arch::x86::semantics::zf_s(inst, expr, dst);

          /* Upate the symbolic control flow */
          triton::arch::x86::semantics::controlFlow_s(inst);
        }

For instruction: 4050a0: subsd xmm0, qword ptr [rbp - 0x60]

In my script, for insertCall(cbefore, INSERT_POINT.BEFORE), I do the following:

for expr in inst.getSymbolicExpressions():
        print expr
        print "AST", expr.getAst()
    print
if inst.getAddress() == 0x4050a0:
        var1 = convertRegisterToSymbolicVariable(REG.XMM0)
        print var1  ### SymVar_0:128
        memaddress = inst.getOperands()[1].getAddress()
        var2 = convertMemoryToSymbolicVariable(MemoryAccess(memaddress,  CPUSIZE.QWORD))
        print var2  ### SymVar_1:64

The result looks good:

**ref!87** = ((_ zero_extend 0) (bvsub ((_ extract 127 0) ref!85) (concat ((_ extract 7 0) ref!31) ((_ extract 7 0) ref!32) ((_ extract 7 0) ref!33) ((_ extract 7 0) ref!34) ((_ extract 7 0) ref!35) ((_ extract 7 0) ref!36) ((_ extract 7 0) ref!37) ((_ extract 7 0) ref!38)))) ; SUBSD operation
**AST** ((_ zero_extend 0) (bvsub ((_ extract 127 0) ref!85) (concat ((_ extract 7 0) ref!31) ((_ extract 7 0) ref!32) ((_ extract 7 0) ref!33) ((_ extract 7 0) ref!34) ((_ extract 7 0) ref!35) ((_ extract 7 0) ref!36) ((_ extract 7 0) ref!37) ((_ extract 7 0) ref!38))))

In my script, for insertCall(cafter, INSERT_POINT.AFTER), I also output the expr and getAst()
The result is:

ref!87 = SymVar_0 ; SUBSD operation
SymVar_0

Based on the result, I think whether the problem may be Symbolic simplification callback.
createSymbolicExpression -> createSymbolicRegisterExpression -> newSymbolicExpression -> processSimplification.

If you need my script and test binary, I can send them to you later. But I think the main problem is for symbolic. Thanks.

JonathanSalwan added the Discussion label Jan 18, 2017

JonathanSalwan closed this as completed Jan 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Taint analysis in Double-Precision Floating-Point instruction #471

Taint analysis in Double-Precision Floating-Point instruction #471

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017 •

edited

Loading

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017 •

edited

Loading

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

pfsun commented Jan 18, 2017

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 19, 2017

pfsun commented Feb 17, 2017 •

edited

Loading

JonathanSalwan commented Feb 18, 2017

pfsun commented Feb 18, 2017

Taint analysis in Double-Precision Floating-Point instruction #471

Taint analysis in Double-Precision Floating-Point instruction #471

Comments

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017 • edited Loading

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017 • edited Loading

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

JonathanSalwan commented Jan 18, 2017

pfsun commented Jan 18, 2017

pfsun commented Jan 18, 2017

JonathanSalwan commented Jan 19, 2017

pfsun commented Feb 17, 2017 • edited Loading

JonathanSalwan commented Feb 18, 2017

pfsun commented Feb 18, 2017

JonathanSalwan commented Jan 18, 2017 •

edited

Loading

JonathanSalwan commented Jan 18, 2017 •

edited

Loading

pfsun commented Feb 17, 2017 •

edited

Loading