Category: Computer Vision

  • PyTorch Lightning: How To Keep Your PyTorch Project Clean

    As an engineer in perception and computer vision, I tend to work with PyTorch a lot.  It’s a very flexible framework and there are already many useful components built into it that prevent you from having to build it yourself.  However, if you have already built a few projects with it, you might have noticed…

  • A Starting Point for Learning Visual SLAM

    If you are like me, you are probably fascinated by the idea of SLAM and since you are here reading Perception ML articles, you are probably even more excited about Visual SLAM.  Along my path to become a better engineer, I have been doing some self-study about Visual SLAM.

  • Why You Should Always Visualize A Batch From Your Data Loader

    Here is one thing that every engineer working with AI hears at some point: Garbage In, Garbage Out However, despite that, something that I still see often enough in the Computer Vision industry and no doubt affects other machine learning disciplines as well is a complete lack of checking what is coming out of the…

  • Einops: Making Life Bit Easier (Mostly)

    When working in PyTorch, you are often faced with the need to manipulate tensors of multiple dimensions into various shapes or maybe combine dimensions together.  There’s a wealth of functions specifically for that too.  To name a few, view, permutate, stack, tile, and concat are some of the most common ones, but the list goes…

  • Computer Vision: How Your Laptop Sees Things

    You may have already noticed it, but in this day and age computers being able to understand images is a wide spread concept.  When you open your photo album app and search for “baby kittens”, somehow the computer magically knows you are looking for adorable little furballs instead of last night’s dinner or whatever else…

en_USEN