bolu.dev | 💻 + 🧬 + 📸

May 30, 2024 / 4 min
`std::ref` and `std::reference_wrapper` in C++

In refactoring legacy C++ codebases we often have to deal with a lot of functions or class methods that takes a pointer as an argument and then does a bunch of null checks. This is a common pattern in C++ codebases that are not modernized yet.

Modern C++ has introduced a few utilities to help with this pattern. One of them is std::ref and std::reference_wrapper. In this post, I wanted to talk about these tools and how they can improve the safety and readability of modern C++ code.

Read more...
May 23, 2024 / 11 min
Let's build an asyncio runtime from scratch in Python

asyncio in Python is a library that provides a way to write concurrent code using the async and await syntax. It is built on top of the asyncio event loop, which is a single-threaded event loop that runs tasks concurrently. Inspired by a similar post by Jacob, we will explore how asyncio works from scratch by implementing our own event loop runtime with Python generators.

Read more...
Mar 1, 2024 / 2 min
jthread in C++20
std::jthread introduced in C++20 is a new thread class that is cancellable and joinable. It is a wrapper around std::thread that provides a few additional features. In this post, I wanted to talk about std::jthread and how it can be used in modern C++ codebases.

Advantages over C++11 std::thread:
- cancellable, can be stopped at any time, unlike std::thread which can only be stopped at the end of the thread function
- works better with RAII pattern, since it can be joined or detached in the destructor
Read more...
Dec 1, 2023 / 7 min
Build a strong type system via Python typehints
Python typehinting system is getting more powerful by each Python version. Projects I’m involved with are now enforcing typehints on all new code. This has been great for a variety of reasons:
- Improves IDE support in terms of linting, autocompletion, and refactoring
- Makes the codebase more readable and maintainable
- Helps catch bugs early in the development cycle
In this post, I’ll share some of the additional features we’ve been able to enable now that most of our codebases are typehinted.

Read more...
Sep 1, 2023 / 7 min
Get the Python GIL play nice with C++
It is no surprise that the GIL is one of the biggest drawbacks of using Python in performance oriented applications. The GIL, or Global Interpreter Lock, is a mutex that protects access to Python objects, preventing multiple threads from executing Python bytecodes at once. This means that even if you have multiple threads running in parallel, only one of them can execute Python code at a time. This can be a major bottleneck for applications that require high performance, as it limits the amount of parallelism that can be achieved.

To defeat the GIL, there are two commonly taken path:
- the first is to opt for multiprocessing instead of threads.
- Re-write the core performance critical code using a lower level language such as C++ or Rust
Today, let’s talk about the 2nd approach. With excellent next generation binding libraries such as pybind11 and pyo3, it has become a lot simpler to support Rust/C++ code in a Python project.

However, often the porting to C++ / Rust from existing application code do not happen overnight. In the beginning, it is mostly a few performance critical functions that are ported to C++ / Rust. In such cases, it is common to see a mix of Python and C++ / Rust code in the same project. In these cases, the threading architecture / parallelism code could still be in Python, while the performance critical code is in C++ / Rust.

I’ve personally dealt with such systems where the GIL became a major bottleneck in the performance of the system due to ill-undertsanding of how it worked. As a result, I’m sharing my findings here.

Read more...
May 1, 2023 / 2 min
Stack optimization for small sized objects in modern C++
I came across a popular technique for providing a handle for storing small objects in the handle itself and larger ones on the heap. Using modern C++, this can be implemented quite nicely at compile time. Here is a simple example:
```
// max bytes to store on the stack
constexpr int on_stack_max = 20;

template<typename T>
struct Scoped {     // store a T in Scoped
        // ...
    T obj;
};

template<typename T>
struct OnHeap {    // store a T on the free store
        // ...
        T* objp;
};

template<typename T>
using Handle = typename std::conditional<(sizeof(T) <= on_stack_max),
                    Scoped<T>,      // first alternative
                    OnHeap<T>      // second alternative
               >::type;

void f()
{
    Handle<double> v1;                   // the double goes on the stack
    Handle<std::array<double, 200>> v2;  // the array goes on the free store
}
```
Let’s break this down
- constexpr int on_stack_max = 20;: This line defines a constant expression for the maximum number of bytes that can be stored on the stack.
- template<typename T> struct Scoped { T obj; };: This is a template struct that can store an object of any type T on the stack.
- template<typename T> struct OnHeap { T* objp; };: This is a template struct that can store a pointer to an object of any type T on the heap.
- template<typename T> using Handle = typename std::conditional<(sizeof(T) <= on_stack_max), Scoped<T>, OnHeap<T>>::type;: This line defines a template alias Handle that uses std::conditional to decide whether to use Scoped<T> or On_heap<T>. If the size of T is less than or equal to on_stack_max, it uses Scoped<T>. Otherwise, it uses On_heap<T>.
- void f() { Handle<double> v1; Handle<std::array<double, 200>> v2; }: This function demonstrates how to use the Handle template. v1 is a Handle that stores a double on the stack, because the size of a double is less than on_stack_max. v2 is a Handle that stores an std::array<double, 200> on the heap, because the size of std::array<double, 200> is greater than on_stack_max.
Of course, this assumes that T can be copied and moved around, and that it has a finite size. If T is not copyable or movable, you will need to adjust the implementation accordingly.

This shows how powerful modern C++ can be in terms of compile-time programming. It allows you to make decisions at compile time based on the properties of types, which can lead to more efficient and flexible code.

Read more...
Oct 14, 2022 / 11 min
Dive into Python asyncio - part 2
In the second part of this series on deep diving into asyncio and async/await in Python, we will be looking at the following topics:
- task, task groups, task cancellation
- async queues
- async locks and semaphores
- async context managers
- async error handling
Read more...

Sep 30, 2022 / 8 min

Dive into Python asyncio - part 1

For as long as I have worked in Python land, I never had to touch the async part of the language. I know that asyncio library has gotten a lot of love in the past few years. Recently I’ve came across an opportunity to do a lot of IO and non-cpu bound work in Python. I decided to take a deep dive into the asyncio library and see what it has to offer.

In part 1 of this series (I originally just wanted to write one post and realized the scope is way too big), we’ll cover:

How async code interfaces with synchronous code in Python
How to convert synchronous code to asynchronous code, including how to prevent blocking of the event loop via custom ThreadPoolExecutor
How to use asyncio to run multiple tasks concurrently

Basic example, async hello world

import asyncio

async def hello_world():
    asyncio.sleep(1)
    print("Hello world")

asyncio.run(hello_world())

>>> Hello world

Running two async functions in parallel

import asyncio

async def foo():
    while True:
        asyncio.sleep(1)
        print("foo")

async def bar():
    while True:
        asyncio.sleep(1)
        print("bar")

asyncio.run(asyncio.gather(foo(), bar()))

What if I have existing synchronous methods?

We can wrap a synchronous function in an async function, an example implementation would be a decorator (i love decorators, btw):

def async_wrap(
    loop: Optional[asyncio.BaseEventLoop] = None, executor: Optional[Executor] = None
) -> Callable:
    def _async_wrap(func: Callable) -> Callable:
        @wraps(func)
        async def run(*args, loop=loop, executor=executor, **kwargs):
            if loop is None:
                loop = asyncio.get_event_loop()
            pfunc = partial(func, *args, **kwargs)
            return await loop.run_in_executor(executor, pfunc)

        return run

    return _async_wrap

The above decorator is a higher order decorator (it takes arguments and then generates another decorator), example usage is the following:

import asyncio
import time

@async_wrap()
def foo():
    while True:
        time.sleep(1)
        print("foo from sync")

async def bar():
    while True:
        asyncio.sleep(1)
        print("bar from async")

asyncio.run(asyncio.gather(foo(), bar()))

Javascript oddities

A collection of weird things in Javascript:

1. `var` scoping rules

for (var i = 0; i < 3; ++i)
{
	const log = () => {
  	console.log(`a ${i}`);
  }
  setTimeout(log, 100);
}

for (let i = 0; i < 3; ++i)
{
	const log = () => {
  	console.log(`b ${i}`);
  }
  setTimeout(log, 100);
}

The output here is:

"a 3"
"a 3"
"a 3"
"b 0"
"b 1"
"b 2"

Why does var cause it to print 3?

2. `const` in Javascript does not mean the same as C/C++. Example:

const value = 3;
value = 4; // error, cannot override a constant
value += 3; // error

const obj = {a : 3};
obj.a += 3; //allowed
obj.a = 5; //allowed

Turns out const in Javascript is more of a “const” reference like const & in C++. It does not mean the value itself is constant - just the reference to the array cannot be changed.

3. Converting time formats can be tricky

Suppose you have a time in yyyy-mm-DD format and you want it in mm/DD/yyyy format.

new Date('2016-06-05').
  toLocaleString('en-us', {year: 'numeric', month: '2-digit', day: '2-digit'})

// Output:
>>> '06/04/2016'

Wait, what happened?, I asked for 2016-06-05 in mm/dd/YYYY but it gave me 06/04/2016 instead! This because all dates by default assumes it’s GMT time, when you convert it to a local timezone, you might get a different date.

The moment library fortunately makes this a lot easier.

var date = new Date('10/01/2021');
var formattedDate = moment(date).format('YYYY-MM-DD');

If we don’t want some extra dependency, it’s probably easier to just not convert the date into a Javascript Date obj and directly do string operations on it to get it to the format you want. Example:

function reformatDateString(dateString) {
    //reformat date string to from YYYY-MM-DD to MM/DD/YYYY
    if (dateString && dateString.indexOf('-') > -1) {
        const dateParts = dateString.split('-');
        return `${dateParts[1]}/${dateParts[2]}/${dateParts[0]}`;
    }
    return dateString;
}

`std::ref` and `std::reference_wrapper` in C++

Let's build an asyncio runtime from scratch in Python

jthread in C++20

Build a strong type system via Python typehints

Get the Python GIL play nice with C++

Stack optimization for small sized objects in modern C++

Dive into Python asyncio - part 2

Dive into Python asyncio - part 1

Basic example, async hello world

What if I have existing synchronous methods?

What is copiable?

What is copiable anyway?

Questions I had:

Dependency injection in Python

Javascript oddities

1. `var` scoping rules

2. `const` in Javascript does not mean the same as C/C++. Example:

3. Converting time formats can be tricky

Rust like enums in C++

Python testing ecosystem

Common, stupid, but non-obvious C++ mistakes I made

1. Capture by reference on transient objects

Rich D3 interactivity in Jekyll posts

Basic example, async hello world

What if I have existing synchronous methods?

What is copiable anyway?

Questions I had:

1. var scoping rules

2. const in Javascript does not mean the same as C/C++. Example:

3. Converting time formats can be tricky

1. Capture by reference on transient objects

1. `var` scoping rules

2. `const` in Javascript does not mean the same as C/C++. Example: