Category: Computer Vision

PyTorch Lightning: How To Keep Your PyTorch Project Clean

As an engineer in perception and computer vision, I tend to work with PyTorch a lot. It’s a very flexible framework and there are already many useful components built into it that prevent you from having to build it yourself. However, if you have already built a few projects with it, you might have noticed…

February 13, 2024
A Starting Point for Learning Visual SLAM

If you are like me, you are probably fascinated by the idea of SLAM and since you are here reading Perception ML articles, you are probably even more excited about Visual SLAM. Along my path to become a better engineer, I have been doing some self-study about Visual SLAM.

January 1, 2024
Always Visualize A Batch From Your Data Loader

Here is one thing that every engineer working with AI hears at some point: Garbage In, Garbage Out However, despite that, something that I still see often enough in the Computer Vision industry and no doubt affects other machine learning disciplines as well is a complete lack of checking what is coming out of the…

December 22, 2023
Einops: Make Your Life Easier (Mostly)

When working in PyTorch, you are often faced with the need to manipulate tensors of multiple dimensions into various shapes or maybe combine dimensions together. There’s a wealth of functions specifically for that too. To name a few, view, permutate, stack, tile, and concat are some of the most common ones, but the list goes…

December 22, 2023
Computer Vision: How Your Laptop Sees Things

You may have already noticed it, but in this day and age computers being able to understand images is a wide spread concept. When you open your photo album app and search for “baby kittens”, somehow the computer magically knows you are looking for adorable little furballs instead of last night’s dinner or whatever else…

October 12, 2023