NA-MIC Project Weeks

Back to Projects List

Methods for self-supervised depth estimation and motion estimation in coloscopy under deformation

Key Investigators

Megha Kalia (Brigham and Women’s Hospital, Harvard Medical School)

Presenter location: In-person

Project Description

Estimating Depth and localizing endoscope in surgical environment is critical in many tasks such as, intra-operative registration, augmented reality, surgical automation, among many others. Monocular self-supervised depth and pose estimation methods can estimate depth and camera pose without requiring labels. However, how do these methods perform in presence of deformation while endoscope moves through the lumen is not known. Therefore, through this project we want to evaluate the effect of addition of primarily two modules on depth and pose estimation accuracy. These modules are TransUnet and Optical Flow Module. Optical Flow can capture the image intensity changes in the scene because of deformation. And TransUnet can potentially capture the temporal correlations between the image frames to give better pose and depth predictions. For the project open sources dataset and github codes will be utilized.

Objective

Objective A. To build and run and train the FlowNet on colonoscopy dataset
Objective B. To integrate the flowNet module in the Monodepth2 framework
Objective C. To integrate and evaluate TransUnet blocks in Monodepth2 framework.

Approach and Plan

Run the Monodepth2 on the colonoscopy dataset.
Train the optical flow network on the coloscopic dataset

Progress and Next Steps

Run the model on the coloscopic dataset.
Run TransUnet
Replace the TransUnet in Monodepth2 and run on the dataset
Describe specific steps you have actually done.
Initial implementation of optical flow network
Replace the Unet with TransUnet and see its effect of depth and pose prediction

Illustrations

Coming Soon!