Lucent Programming Language Implementation

Introduction

Welcome to Lucent, a modern programming language designed to be both lucid and fluent. Lucent combines strong type safety with an intuitive syntax, making it ideal for developers of all experience levels. This guide will help you get started with Lucent and explore its powerful features.

Writing Your First Program

Create a new file, hello_world.txt, and add the following code:

println("Hello, World!");

Run the program using the following command:

./run.sh hello_world.txt

You should see the output:

Hello, World!

Key Features

Lucent is designed to be user-friendly and powerful. Here are some of its core features:

Control Flow

Lucent provides comprehensive flow control with strict type checking:

If-then-else expressions with boolean conditions
While loops with break/continue statements
Function definitions with type annotations and return values
Let expressions for variable declarations
Return statements for explicit function output
Multiple statement blocks with proper scoping

Type System

Lucent's static type system ensures compile-time guarantees:

Basic types: int, bool, string
Array types with indexing and slicing: int[], string[], etc.
Multi-dimensional arrays: int[][], etc.
Dictionary type with string keys: dict
User-defined types (structs) with field access
Type annotations for function parameters and return types
Explicit type conversion using str() and parseInt()
Array initialization using new operator

Operations

Lucent supports type-safe operations with overflow protection:

Arithmetic: +, -, *, /, %, ** (power)
String: ++ (concatenation with strict type checking)
Comparison: <, >, <=, >=, ==, !=
Logical: and, or
Array concatenation using + operator
Array and string slicing using [start:end] syntax

Data Structures

Lucent includes built-in collections with strong type guarantees:

Arrays with indexing, slicing, and length operation
Dictionaries with string keys and arbitrary value types
User-defined types (structs) with named fields
Multi-dimensional arrays

Advanced Features

First-class functions and closures
Function inlining optimization
Peephole optimization (constant folding)
Bytecode compilation for efficient execution
Proper lexical scoping
Variable capture in closures

User-Defined Functions

Lucent provides comprehensive support for user-defined functions, which are a central feature. Functions in Lucent have:

Function Definition and Calling

Functions are defined using the fun keyword, with explicit parameter types and return type:

fun add(x: int, y: int): int {
    return x + y;  # Type-checked return value
}

int result = add(5, 10);
println(result);  # Prints 15

Function Return Values

Explicit return types with compile-time verification:

fun max(a: int, b: int): int {
    if (a > b) {
        return a;
    } else {
        return b;
    }
}

Recursive Functions

Lucent fully supports recursive function calls:

fun factorial(n: int): int {
    if (n <= 1) {
        return 1;
    } else {
        return n * factorial(n-1);
    }
}

println(factorial(5));  # Prints 120

Functions with Arrays

Functions can take arrays as parameters and return arrays:

fun sumArray(arr: int[]): int {
    int sum = 0;
    int i = 0;
    while(i < len(arr)) {
        sum = sum + arr[i];
        i = i + 1;
    }
    return sum;
}

int[] numbers = [1, 2, 3, 4, 5];
println(sumArray(numbers));  # Prints 15

Function Closures

Lucent supports closures, allowing functions to capture variables from their outer scope:

fun makeAdder(x: int): int {
    fun add(y: int): int {
        return x + y;  # x is captured from outer function
    }
    return add(10);  # Returns x + 10
}

println(makeAdder(5));  # Prints 15

Proper Variable Scoping

Functions create their own scope, and parameters are local to the function:

int x = 1;
fun foo(x: int): int {
    x = 2;         # Changes local x, not global x
    return x;
}
println(foo(x));   # Prints 2
println(x);        # Prints 1 (global x is unchanged)

User-Defined Types

Our language implements a robust type system that supports custom data structures through a struct-like syntax. Key features include:

Dictionary-style field access for clean syntax
Strong type checking for field assignments
Support for nested type definitions
First-class type system integration
Immutable field names with mutable values
Full interoperability with functions and arrays

Type Definition and Creation

# Define a Person type
type Person { "name": string, "age": int };

# Create an instance
Person p = Person { "name": "Alice", "age": 30 };

# Access fields
println(p{"name"});  # Prints "Alice"
println(p{"age"});   # Prints 30

Field Modification

The language provides controlled mutability for struct fields:

Type-safe field updates
Runtime bounds checking
Maintains type consistency
Prevents undefined field access

type Counter { "value": int };
Counter c = Counter { "value": 0 };

# Update the field
c{"value"} = c{"value"} + 1;
println(c{"value"});  # Prints 1

Nested Types

Types can be nested within other types:

type Point { "x": int, "y": int };
type Circle { "center": Point, "radius": int };

Circle c = Circle { 
    "center": Point { "x": 5, "y": 10 }, 
    "radius": 15 
};

println(c{"center"}{"x"});  # Prints 5

Using Types with Functions

Types can be used as function parameters and return values:

type Rectangle { "width": int, "height": int };

fun area(rect: Rectangle): int {
    return rect{"width"} * rect{"height"};
}

Rectangle r = Rectangle { "width": 5, "height": 10 };
println("Area: " ++ str(area(r)));  # Prints "Area: 50"

Array Operations

Arrays are first-class citizens with comprehensive features:

Zero-based indexing with bounds checking
Dynamic size allocation with new operator
Static initialization with literals
Multi-dimensional support
Efficient memory management
Built-in length operations
Type-safe concatenation and slicing

Array Creation and Access

# Static array initialization
int[] numbers = [1, 2, 3, 4, 5];
string[] words = ["hello", "world"];

# Dynamic array initialization using new
int[] nums = new int[10];          # Creates [0,0,0,0,0,0,0,0,0,0]
bool[] flags = new bool[5];        # Creates [false,false,false,false,false]
string[] names = new string[3];    # Creates ["","",""]

# Multi-dimensional array initialization
int[][] matrix = new int[3][3];    # Creates 3x3 matrix of zeros
int[][][] cube = new int[2][2][2]; # Creates 2x2x2 3D array of zeros

# Empty array initialization 
int[] empty = [];

# Array indexing (zero-based)
println(numbers[0]);  # Prints 1
println(words[0]);    # Prints "hello"

# Array assignment
numbers[1] = 10;
words[1] = "WORLD";

Array Length

int[] arr = [1, 2, 3, 4, 5];
println(len(arr));  # Prints 5

Array Slicing

int[] arr = [1, 2, 3, 4, 5];
int[] sliced = arr[1:4];  # Creates [2, 3, 4]

Array Concatenation

int[] a = [1, 2, 3];
int[] b = [4, 5, 6];
int[] combined = a + b;  # Results in [1, 2, 3, 4, 5, 6]

Multi-dimensional Arrays

int[][] matrix = [[1, 2, 3], [4, 5, 6], [7, 8, 9]];
println(matrix[0][1]);  # Prints 2

String Operations

String handling includes:

Immutable string values
Efficient concatenation
Unicode support
Built-in length and indexing
Type-safe operations
Automatic memory management

String Concatenation

string greeting = "Hello";
string target = "World";
string message = greeting ++ " " ++ target;  # "Hello World"

String Indexing

string text = "Hello";
println(text[0]);  # Prints "H"
println(text[4]);  # Prints "o"

String Length

string text = "Hello";
println(len(text));  # Prints 5

String Slicing

string s = "hello world";
string sliced = s[1:5];  # "ello"

String Conversion

# String to int conversion
string numStr = "42";
int num = parseInt(numStr);    # Converts string to int: 42
string invalid = "abc";
int result = parseInt(invalid); # Runtime error: Invalid integer format

# Int to string conversion
int x = 42;
string text = str(x);         # Converts int to string: "42"

Dictionary Operations

Dictionaries provide:

String-keyed lookup tables
Dynamic size allocation
Type-safe value storage
Fast key-based access
Mutable value storage
Runtime key validation

# Dictionary creation
dict person = {"name": "Alice", "age": 30, "city": "Wonderland"};

# Accessing dictionary values
string name = person{"name"};  # "Alice"
int age = person{"age"};       # 30

# Updating dictionary values
person{"age"} = 31;
person{"city"} = "New York";

Control Flow

Control structures feature:

Short-circuit evaluation
Proper block scoping
Structured loop control
Conditional branching
Loop interruption tools
Scope-aware variables

If-Else Statements

int x = 10;
if (x > 5) {
    println("x is greater than 5");
} else {
    println("x is not greater than 5");
}

While Loops

int i = 0;
while (i < 5) {
    println(i);
    i = i + 1;
}

Break and Continue

int i = 0;
while (i < 10) {
    i = i + 1;
    if (i % 2 == 0) {
        continue;  # Skip even numbers
    }
    if (i > 7) {
        break;     # Exit loop when i > 7
    }
    println(i);    # Prints 1, 3, 5, 7
}

Optimizations

The compiler implements several optimization strategies:

Constant expression evaluation
Dead code elimination
Function inlining for performance
Loop optimization
Common subexpression elimination
Register allocation

Function Inlining

The compiler automatically inlines small, non-recursive functions to improve performance:

fun triple(x: int): int {
    return 3 * x;
}

# This will be optimized by inlining the triple function
int result = triple(5);  # Becomes effectively: int result = 3 * 5;

Peephole Optimization

The compiler performs constant folding and other peephole optimizations:

int x = 5 + 3 * 2;  # Compiled as: int x = 11;
bool check = 5 < 10 and 20 > 15;  # Compiled as: bool check = true;

Bytecode Compilation

Our language includes an efficient bytecode compiler and virtual machine for executing programs:

The bytecode compiler transforms the AST into a sequence of bytecode instructions
The bytecode VM interprets these instructions more efficiently than direct AST interpretation
Function inlining and constant folding optimizations are performed during bytecode generation
The bytecode format includes instruction opcodes, constants, and variable information

Example Bytecode Execution

Project Euler solutions showcase the efficiency of our bytecode VM:

Compiling code...

Bytecode Stats:
Instructions: 21
Constants: 6
Variables: 3
Max Stack Size: 4

Executing bytecode...
Sum of all multiples of 3 or 5 below 1000: 233168

Expected output: Sum of all multiples of 3 or 5 below 1000: 233168
Actual output: Sum of all multiples of 3 or 5 below 1000: 233168
Output matches expected!

Running the Language

Using the run.sh Script

To execute programs written in our language, use the run.sh script:

./run.sh your_program.txt

This script compiles your program and runs it, displaying the output in the terminal.

Command Line Options

Flexible runtime configuration:

# Full compiler pipeline with optimizations
./run.sh program.txt

# Run with interpreter only (no bytecode)
./run.sh program.txt --interpret

# Run with debug information
./run.sh program.txt --debug

# Run with optimization disabled
./run.sh program.txt --no-optimize

Project Structure

The project is organized for maintainability and clarity:

Modular component design
Clear separation of concerns
Comprehensive test coverage
Example-driven documentation
Easy-to-use build system
Flexible configuration options

/
├── main.py                 # Core language implementation
├── run.sh                  # Script to run programs
├── tests/                  # Test suites
│   ├── __init__.py        # Makes tests a package
│   ├── test_framework.py  # Testing infrastructure
│   ├── unit_tests.py      # Basic language feature tests
│   ├── error_tests.py     # Error handling tests
│   ├── bytecode_tests.py  # Tests for bytecode compilation
│   ├── project_euler_tests.py  # Complex algorithmic tests
│   └── tests.py           # Test runner
├── examples/               # Example programs
│   ├── basics.txt         # Basic language features
│   ├── closures.txt       # Closure and function examples 
│   └── structs.txt        # User-defined types examples
├── LICENSE                # MIT License
└── README.md             # This documentation

Examples from Project Euler

The language is powerful enough to solve complex algorithmic problems. Here are some examples:

Project Euler #1 - Multiples of 3 or 5

int x = 1;
int sum = 0;
while(x < 1000) {
    if (x % 3 == 0 or x % 5 == 0) {
        sum = sum + x;
    }
    x = x + 1;
}
println("Sum of all multiples of 3 or 5 below 1000: " ++ str(sum));
# Output: Sum of all multiples of 3 or 5 below 1000: 233168

Project Euler #2 - Even Fibonacci Sum

int sum = 0;
int a = 1;
int b = 2;
while (b < 4000000) {
    if (b % 2 == 0) {
        sum = sum + b;
    }
    int temp = a + b;
    a = b;
    b = temp;
}
println("Sum of even-valued Fibonacci terms below 4 million: " ++ str(sum));
# Output: Sum of even-valued Fibonacci terms below 4 million: 4613732

Project Euler #4 - Largest Palindrome Product

int maxpalindrome = 0;
int i = 99;  

while (i >= 10) {
    int j = 99;  
    while (j >= i) {  
        int product = i * j;
        
        if (product <= maxpalindrome) {
            break;
        }
        
        int reversed = 0;
        int temp = product;
        while (temp > 0) {
            reversed = reversed * 10 + temp % 10;
            temp = temp / 10;
        }
        
        if (product == reversed and product > maxpalindrome) {
            maxpalindrome = product;
        }
        j = j - 1;
    }
    i = i - 1;
}

println("Largest palindrome made from the product of two 2-digit numbers: " ++ str(maxpalindrome));
# Output: Largest palindrome made from the product of two 2-digit numbers: 9009

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
coverage		coverage
final		final
tests		tests
.coverage		.coverage
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
closure.txt		closure.txt
main.py		main.py
print_test.txt		print_test.txt
run.sh		run.sh
test_runner.py		test_runner.py

License

VenkatReddybathuni/Compilers

Folders and files

Latest commit

History

Repository files navigation

Lucent Programming Language Implementation

Introduction

Writing Your First Program

Key Features

Control Flow

Type System

Operations

Data Structures

Advanced Features

User-Defined Functions

Function Definition and Calling

Function Return Values

Recursive Functions

Functions with Arrays

Function Closures

Proper Variable Scoping

User-Defined Types

Type Definition and Creation

Field Modification

Nested Types

Using Types with Functions

Array Operations

Array Creation and Access

Array Length

Array Slicing

Array Concatenation

Multi-dimensional Arrays

String Operations

String Concatenation

String Indexing

String Length

String Slicing

String Conversion

Dictionary Operations

Control Flow

If-Else Statements

While Loops

Break and Continue

Optimizations

Function Inlining

Peephole Optimization

Bytecode Compilation

Example Bytecode Execution

Running the Language

Using the run.sh Script

Command Line Options

Project Structure

Examples from Project Euler

Project Euler #1 - Multiples of 3 or 5

Project Euler #2 - Even Fibonacci Sum

Project Euler #4 - Largest Palindrome Product

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages