Efficient static checker for tainted variable attacks. (English) Zbl 1410.68077

Summary: Tainted flow attacks originate from program inputs maliciously crafted to exploit software vulnerabilities. These attacks are common in server-side scripting languages, such as PHP. In 1997, P. Ørbæk and J. Palsberg [J. Funct. Program. 7, No. 6, 557–591 (1997; Zbl 0918.03013)] formalized the problem of detecting these exploits as an instance of type-checking, and gave an \(O(V^{3})\) algorithm to solve it, where \(V\) is the number of program variables. A similar algorithm was, ten years later, implemented on the Pixy tool [N. Jovanovic et al., “Pixy: a static analysis tool for detecting web application vulnerabilities”, in: Proceedings of the 2006 IEEE symposium on security and privacy, S&P ’06. Los Alamitos, CA: IEEE Computer Society. 258–263 (2006; doi:10.1109/SP.2006.29)]. In this paper we give an \(O(V^2)\) solution to the same problem. Our solution uses R. Bodik et al.’s [“ABCD: eliminating array bounds checks on demand”, in: Proceedings of the ACM SIGPLAN 2000 conference on programming language design and implementation, PLDI ’00. New York, NY: Association for Computing Machinery (ACM). 321–333 (2000; doi:10.1145/358438.349342)] extended Static Single Assignment (e-SSA) program representation. The e-SSA form can be efficiently computed and it enables us to solve the problem via a sparse dataflow analysis. Using the same infrastructure, we compared a state-of-the-art dataflow solution with our technique. Both approaches have detected 36 vulnerabilities in well known PHP programs. Our results show that our approach tends to outperform the dataflow algorithm for larger inputs. We have reported the new bugs that we found, and an implementation of our algorithm is publicly available at https://github.com/rimsa/tainted-phc.git.


68N20 Theory of compilers and interpreters
68Q25 Analysis of algorithms and problem complexity


Zbl 0918.03013


F4F; AMNESIA; Pixy; GitHub; PHP; TAJ
Full Text: DOI


