experimental: add test-to-harness conversion logic #495

DavidKorczynski · 2024-07-17T16:51:17Z

Adds a fuzz harness heuristic that relies on converting existing tests. At this stage, it's done without relying on FI, we simply (1) find tests files in the target project; (2) read them; (3) for each test file we use a simple prompt to convert it into a harness.

At this stage, it already out-performs on some existing projects, e.g: https://github.com/jkuhlmann/cgltf/blob/master/test/main.c

In this case, we have a harness generated that looks quite nice:

// Heuristic: TestConverterPrompt :: Target: 
#include <stdlib.h>
#include <stdint.h>
#include <stdio.h>
#include <string.h>

#define CGLTF_IMPLEMENTATION
#include "cgltf.h"

extern "C" int LLVMFuzzerTestOneInput(const uint8_t* data, size_t size) {
    if (size < 1) {
        return 0;
    }

    cgltf_options options;
	memset(&options, 0, sizeof(cgltf_options));
	cgltf_data* parsed_data = NULL;
	cgltf_result result;

    // Parse input data
    result = cgltf_parse(&options, data, size, &parsed_data);

    if (result == cgltf_result_success) {
        result = cgltf_validate(parsed_data);
    }

    if (result == cgltf_result_success) {
        // Use the parsed data in some way
        // For example, print file type and mesh count
		printf("Type: %u\n", parsed_data->file_type);
		printf("Meshes: %u\n", (unsigned)parsed_data->meshes_count);
    }

    cgltf_free(parsed_data);

    return 0;
}

Ref: #494

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-07-17T16:54:49Z

/gcbrun skip

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-07-17T19:54:48Z

/gcbrun skip

DonggeLiu

Thanks! Some nits:

experimental/c-cpp/manager.py

DonggeLiu · 2024-07-17T20:46:11Z

experimental/c-cpp/manager.py

+  name = 'TestConverterPrompt'
+
+  def __init__(self, introspector_report: Dict[str, Any],
+               all_header_files: List[str], test_dir: str):


nit: dict[...] and list[...] in lower cases. Same below.

Can't do, see here: #454 (comment)

experimental/c-cpp/manager.py

DonggeLiu · 2024-07-17T23:16:33Z

experimental/c-cpp/manager.py

+    # was found empirically to be valuable.
+    macros_defined_in_test = []
+    for line in test_case.test_content.split('\n'):
+      if '#define' in line and len(line.split(' ')) == 2:


Nit: Maybe exclude commented lines later?

// #define a b /* #define a b */

I'll leave this for now -- it works well but ultimately I would want some stronger logic for macros (e.g. using IR/AST stuff). Will leave as is and monitor if it shows up as a limitation.

experimental/c-cpp/manager.py

DavidKorczynski · 2024-07-18T10:29:49Z

/gcbrun skip

Signed-off-by: David Korczynski <[email protected]>

Adds a fuzz harness heuristic that relies on converting existing tests. At this stage, it's done without relying on FI, we simply (1) find tests files in the target project; (2) read them; (3) for each test file we use a simple prompt to convert it into a harness. At this stage, it already out-performs on some existing projects, e.g: https://github.com/jkuhlmann/cgltf/blob/master/test/main.c In this case, we have a harness generated that looks quite nice: ```c // Heuristic: TestConverterPrompt :: Target: #include <stdlib.h> #include <stdint.h> #include <stdio.h> #include <string.h> #define CGLTF_IMPLEMENTATION #include "cgltf.h" extern "C" int LLVMFuzzerTestOneInput(const uint8_t* data, size_t size) { if (size < 1) { return 0; } cgltf_options options; memset(&options, 0, sizeof(cgltf_options)); cgltf_data* parsed_data = NULL; cgltf_result result; // Parse input data result = cgltf_parse(&options, data, size, &parsed_data); if (result == cgltf_result_success) { result = cgltf_validate(parsed_data); } if (result == cgltf_result_success) { // Use the parsed data in some way // For example, print file type and mesh count printf("Type: %u\n", parsed_data->file_type); printf("Meshes: %u\n", (unsigned)parsed_data->meshes_count); } cgltf_free(parsed_data); return 0; } ``` Ref: google#494 --------- Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski added 2 commits July 17, 2024 09:25

experimental: enable test-to-harness conversion

5085382

Signed-off-by: David Korczynski <[email protected]>

fix styling

6eeaf92

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski mentioned this pull request Jul 17, 2024

cgltf: init google/oss-fuzz#12236

Draft

DavidKorczynski mentioned this pull request Jul 17, 2024

Logic for test-to-harness conversion #494

Open

DavidKorczynski requested review from DonggeLiu and oliverchang July 17, 2024 16:57

cleanup

218a4a6

Signed-off-by: David Korczynski <[email protected]>

DonggeLiu approved these changes Jul 17, 2024

View reviewed changes

address review

3214fd7

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski merged commit 4309def into main Jul 18, 2024
6 checks passed

DavidKorczynski deleted the enable-test-writer branch July 18, 2024 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experimental: add test-to-harness conversion logic #495

experimental: add test-to-harness conversion logic #495

DavidKorczynski commented Jul 17, 2024 •

edited

Loading

DavidKorczynski commented Jul 17, 2024

DavidKorczynski commented Jul 17, 2024

DonggeLiu left a comment

DonggeLiu Jul 17, 2024

DavidKorczynski Jul 18, 2024

DonggeLiu Jul 17, 2024

DavidKorczynski Jul 18, 2024

DavidKorczynski commented Jul 18, 2024

experimental: add test-to-harness conversion logic #495

experimental: add test-to-harness conversion logic #495

Conversation

DavidKorczynski commented Jul 17, 2024 • edited Loading

DavidKorczynski commented Jul 17, 2024

DavidKorczynski commented Jul 17, 2024

DonggeLiu left a comment

Choose a reason for hiding this comment

DonggeLiu Jul 17, 2024

Choose a reason for hiding this comment

DavidKorczynski Jul 18, 2024

Choose a reason for hiding this comment

DonggeLiu Jul 17, 2024

Choose a reason for hiding this comment

DavidKorczynski Jul 18, 2024

Choose a reason for hiding this comment

DavidKorczynski commented Jul 18, 2024

DavidKorczynski commented Jul 17, 2024 •

edited

Loading