Reconstruction of Forensic Timelines Using Graph Theory
recongraph is a Python library designed to reconstruct and visualize system behaviors and activities based on logs from various devices, such as Windows and Linux systems. It converts Plaso log2timeline CSV files into a forensic graph timeline. By parsing sequential log data and mapping them to defined events, recongraph builds a MultiDiGraph (Multi-Directed Graph) that represents the state transitions and operational flow of the target system. This graph-based approach aids in forensic analysis, anomaly detection, and understanding complex system behaviors across diverse platforms.
- Sigma Rule-Based Pattern Matching: Leverages standardized Sigma rules to identify and label security-relevant events in raw logs.
- Forensic Graph Construction: Transforms sequential log entries from Plaso (log2timeline) into a directed graph, where nodes represent detected events and edges represent temporal transitions.
- Intelligent Log Detection: Automatically identifies various log formats (e.g., Apache, Linux auth, Syslog) and extracts relevant metadata like HTTP methods, URIs, and status codes.
- Weighted Behavioral Mapping: Edges are weighted by transition frequency, helping to distinguish common flows from rare or suspicious sequences.
- Anomaly-Focused Reconstruction: Specifically isolates and maps behaviors based on rule severity levels (Critical, High, Medium, Low).
- Multi-Format Export: Exports graphs to GraphML for visualization (Gephi, Cytoscape) and detailed forensic timelines to CSV.
- Python 3.13 or higher
- Git
- Python virtual environment (venv or conda)
Recongraph uses several Python packages to function properly. It is recommended to install the package in a virtual environment to avoid dependency conflicts. Here is a simple example of how to create and activate a virtual environment:
-
Anaconda or Miniconda
conda create -n recongraph python conda activate recongraph
Or using venv (recommended):
-
Venv
python -m venv venv source venv/bin/activate
Recongraph package installation can be done directly from PyPI using pip or by cloning this repository
pip install recongraphOr installing by cloning this repository:
-
Clone the Repository
git clone https://github.com/forensic-timeline/recongraph
-
Install Depedencies
cd recongraph pip install -e .
To use the recongraph tools, sigma rules are needed to label and detect events in the log files. Sigma rules can be downloaded from https://github.com/SigmaHQ/sigma. The sigma rules are released under the Detection Rule License (DRL) 1.1.
Using git clone, you can use the sigma rules folder:
git clone https://github.com/SigmaHQ/sigmaHere is a simple example of how to use recongraph to reconstruct a forensic timeline:
recongraph -f /path/to/your/plaso-file.csv -r /path/to/your/sigma-rules-folder -o output-filename.graphmlTo ensure that the installation is correct and the code is functioning as expected, you can run the test suite provided in the tests/ directory.
-
Install Test Dependencies: Ensure you have
pytestinstalled.pip install pytest pandas pyyaml
-
Run Tests: Navigate to the project root directory and execute:
pytest -v
You should see output indicating that all tests have passed.
recongraph processes raw log data and applies Sigma rules to identify significant security events.
A sequential log file containing system activities. The tool supports supports CSV format from Plaso (log2timeline).
A directory containing standardized Sigma rules in .yml format. These rules define the logic used to detect and label events within the logs.
Sigma rules are downloaded from https://github.com/SigmaHQ/sigma.
The content of that repository is released under the following licenses:
- The Sigma specification (https://github.com/SigmaHQ/sigma-specification) and the Sigma logo are public domain
- The rules contained in the SigmaHQ repository (https://github.com/SigmaHQ) are released under the Detection Rule License (DRL) 1.1
The tool generates several files to aid in analysis:
- GraphML File (
reconstruction_edge_graph.graphml): A directed graph where nodes are detected events and edges represent the flow between them. Suitable for visualization in Gephi or Cytoscape. - Event Logs CSV (
reconstruction_event_logs.csv): A detailed breakdown of every log entry associated with a graph node, including timestamps and raw message content. - Sigma Labeled CSV (
<filename>_sigma_labeled.csv): The input log file augmented with matching Sigma rule titles and severity levels.
Full documentation is available at ReadTheDocs.
This project is licensed under the MIT License.
This project uses Sigma Rules for event detection.
- The Sigma specification and logo are public domain.
- The detection rules from the SigmaHQ repository are released under the Detection Rule License (DRL) 1.1.