Automating Vulnerability Discovery in Critical Applications

Created September 2017

Vulnerabilities are too pervasive in software-based systems to find them all manually. CERT researchers develop automated tools that discover and mitigate software vulnerabilities and transfer them to security researchers, procurement specialists, and software vendors.

Automating Vulnerability Discovery

There are many places in the software lifecycle where software vulnerabilities can be discovered and mitigated. We develop new automated tools and techniques and put them in the hands of security researchers, procurement specialists, and software vendors to help them improve and evaluate the security of the software ecosystem used by the U.S. Department of Defense (DoD) and the U.S. government (USG).

These tools and techniques are intended to be used by vendors during software development. Still, vulnerabilities exist in delivered software. In a project called “Automating Vulnerability Discovery,” we focused on the ability to automatically discover vulnerabilities after software is developed and shipped, without assistance from the software vendor. This approach makes the problem more challenging because we must analyze programs at the binary level. But this requirement is critical in practice because many vendors are reluctant to share access to their products’ source code, which they regard as sensitive and proprietary.

There Are Too Many Vulnerabilities Not to Automate

Vulnerabilities are pervasive in software-based systems and protocols, both in traditional IT networks and in those that support critical U.S. infrastructure. Software is written and updated frequently, and there are not enough human analysts to keep pace with the speed of production. Automation is the only way to keep up.

Led by Dr. Edward Schwarz, the Vulnerability Discovery project aims to reduce vulnerabilities in critical DoD and USG systems by improving techniques for automated vulnerability discovery. If vendors adopt our techniques in their software development processes, the DoD will acquire software applications that are hardened—and more secure—before and after they are deployed into the DoD infrastructure.

Automating Vulnerability Discovery in Critical Applications

Our Approach: Automate Discovery, Prioritize Results, Address the Most Important Vulnerabilities First

Our approach to vulnerability discovery builds on an extensive body of work at Carnegie Mellon University and the SEI's CERT Division. This work includes the following techniques:

Black-box mutational fuzzing—Mutational fuzzing randomly introduces mutations into starting inputs called seed files. It then runs these modified inputs on the program to see if it causes a crash. Fuzzing is one of the most effective vulnerability discovery techniques in practice, and the SEI maintains its own black-box fuzzer, the CERT Basic Fuzzing Framework.
Vulnerability uniqueness determination—A primary challenge in vulnerability discovery is understanding which vulnerability triggered a crash and how serious that vulnerability is. For example, in one of our recent experiments, a major vendor's software produced over 40,000 exploitable crashes, but these crashes were caused by only a few vulnerabilities. We use a new technique that applies software patches to identify the scope of a vulnerability, which allows us to precisely count the number of vulnerabilities that a particular vulnerability discovery technique identifies.

Combining Fuzzing with Concolic Execution

ForAllSecure is a start-up company on a mission to make software safer. The ForAllSecure team builds software security tools for developers, enterprises, and end users that automatically find vulnerabilities in software at the binary level, using research transitioned from the Carnegie Mellon University CyLab.

ForAllSecure is one of the world’s leading experts on binary concolic execution and is widely known for its Mayhem concolic executor, which recently won DARPA’s Cyber Grand Challenge competition. Concolic execution is a powerful form of symbolic program analysis that describes program executions as logical formulas and solves them to trigger and test new fragments of program code. Concolic execution is known for its ability to trigger code that is difficult to reach, allowing it to find vulnerabilities that other techniques such as fuzzing might miss. This power comes with a price, however, as concolic execution tends to be slow and has not scaled to large programs.

By joining our expertise in fuzzing with ForAllSecure’s expertise in concolic execution, we have been collaborating on a new technique that brings the best of both fuzzing and concolic execution. Fuzzing brings scalability, speed, and the ability to discover vulnerabilities in large, complex programs. Concolic execution allows analysts to omit seed files, making the process of vulnerability discovery easier than ever. It also allows software analysts to test and detect vulnerabilities in code that is difficult for a fuzzer to reach.

Software and Tools

Big Grep

August 11, 2017

BigGrep is a tool used to index and search a large corpus of binary files and uses a probabilistic N-gram based approach to balance index size and search speed.

Learn More

CERT Tapioca

July 12, 2017

CERT Tapioca is a network-layer MITM proxy utility that checks for apps that fail to validate certificates and investigates content of network traffic, including HTTP and HTTPS.

Learn More

CERT BFF

October 05, 2016

CERT BFF is a software-testing tool that finds defects in applications that run on Microsoft Windows, Linux, Mac OS X, and other unix-like platforms.

Learn More

bgpuma

December 09, 2015

bgpuma is a tool that looks through BGP update files quickly to find direct matches for CIDR blocks and CIDR blocks that contain the initial set and are contained by the initial set.

Learn More

CERT Dranzer

June 01, 2015

Dranzer is a tool that enables users to examine effective techniques for fuzz testing ActiveX controls.

Learn More

CERT Triage Tools

May 05, 2014

CERT Triage Tools consist of a triage script and a GNU Debugger (GDB) extension named 'exploitable' that classify Linux application defects by severity.

Learn More

CERT FOE

September 23, 2013

Failure Observation Engine (FOE) is a mutational file-based fuzz testing tool for finding defects in applications that run on the Windows platform.

Learn More

CERT IPA

September 01, 2011

CERT IPA is an IP address annotation system that provides a repository of IP address information and related tools for accessing the data.

Learn More

Lessons Learned in Coordinated Disclosure for Artificial Intelligence and Machine Learning Systems

August 20, 2024 •White Paper

By
Allen D. Householder, Vijay S. Sarvepalli, Jeff Havrilla, Matt Churilla, Lena Pons, Shing-hon Lau, Nathan M. VanHoudnos, Andrew Kompanek, and Lauren McIlvenny

In this paper, the authors describe lessons learned from coordinating AI and ML vulnerabilities at the SEI's CERT/CC.

Read

Weaknesses and Vulnerabilities in Modern AI: AI Risk, Cyber Risk, and Planning for Test and Evaluation

August 12, 2024 •Blog Post

By
Bill Scherlis

Modern AI systems pose consequential, poorly understood risks. This blog post explores strategies for framing test and evaluation practices based on a holistic approach to AI risk.

READ

Weaknesses and Vulnerabilities in Modern AI: Integrity, Confidentiality, and Governance

August 05, 2024 •Blog Post

By
Bill Scherlis

In the rush to develop AI, it is easy to overlook factors that increase risk. This post explores AI risk through the lens of confidentiality, governance, and integrity.

READ

Weaknesses and Vulnerabilities in Modern AI: Why Security and Safety Are so Challenging

July 29, 2024 •Blog Post

By
Bill Scherlis

This post explores concepts of security and safety for neural-network-based AI, including ML and generative AI, as well as AI-specific challenges in developing safe and secure systems.

READ

Safeguarding Against Recent Vulnerabilities Related to Rust

June 28, 2024 •Podcast

By
David Svoboda

David Svoboda discusses two vulnerabilities related to Rust, their sources, and how to mitigate them.

Listen

Application Programming Interface (API) Vulnerabilities and Risks

June 24, 2024 •Special Report

By
McKinley Sconiers-Hasan

This report describes 11 common vulnerabilities and 3 risks related to application programming interfaces, providing suggestions about how to fix or reduce their impact.

Read

Improve Your AI Classifiers with AIR Using Causal Discovery, Identification, and Estimation

May 02, 2024 •Fact Sheet

The SEI has developed a new AI Robustness (AIR) tool that allows users to gauge AI and ML classifier performance with unprecedented confidence.

Learn More

What Recent Vulnerabilities Mean to Rust

April 29, 2024 •Blog Post

By
David Svoboda

In recent weeks several vulnerabilities have rocked the Rust community causing many to question its safety. This post examines two such vulnerabilities.

READ

Software Engineering Institute