About

Hello! I'm Soheib Takhtardeshir currently a PhD Student at Mid Sweden University, Sweden 🇸🇪 & Université de Rennes I (Under INRIA), France 🇫🇷 as a dual degree PhD Student. My research targets learning-based compression of light field images, aiming to outperform traditional codecs like HEVC (H.265) and VVC (H.266). I design VAE architectures that disentangle spatial and angular features, guided by dual hyperpriors for efficient entropy modeling and mutual information regularization. To support real-time applications in AR/VR and telepresence, I develop fast post-encoding techniques such as latent reordering and non-uniform quantization.

Email
Publications
Subjective Visual Quality Assessment of Compressed Light Field Images: Learning-based vs. Conventional Methods
The IEEE 27th International Workshop on Multimedia Signal Processing (MMSP 2025)
Beijing, China 🇨🇳
Image

Even though many LF compression techniques have been proposed and evaluated in recent years, the leading-edge learning-based approaches have not been subject to the same level of scrutiny. This paper presents a subjective quality assessment study on four different LF compression methods, including two learning-based LF compression methods, which have not been studied before from a subjective quality point of view. For this purpose, subjective opinion scores were collected from viewers in two different universities for a cross-lab study.

Project Page

IEEE Access

Efficient and Fast Light Field Compression via VAE-Based Spatial and Angular Disentanglement
IEEE Access
Image

We propose a Variational Autoencoder (VAE)-based framework to disentangle the spatial and angular features of light field images, focusing on fast and efficient compression. Our method uses two separate sub-encoders: one for spatial and one for angular features to allow for independent processing in the latent space, which facilitates more streamlined compression workflows.

Project Page

QoMEX 2024

A Spherical Light Field Database for Immersive Telecommunication and Telepresence Applications
The 16th International Conference on Quality of Multimedia Experience (QoMEX 2024)
Karlshamn, Sweden 🇸🇪


Light field is a promising technology that enables the capture and reproduction of real light rays from the scene. There are still many challenges in transmitting and reproducing light field data. This paper proposes a spherical light field dataset that can be used as a foundation for developing telepresence applications. The Spherical Light Field Database (SLFDB) consists of a light field of 60 views captured with an omnidirectional camera in 20 scenes. To show the usefulness of the proposed database, we provide two use cases: compression and viewpoint estimation.

Paper

EL 2024

A Deep Learning based Light Field Image Compression as Pseudo Video Sequences with Additional in-loop Filtering
3D Imaging and Applications 2024 | IS&T International Symposium on Electronic Imaging 2024
San Francisco, USA 🇺🇸
Image

In this paper, we introduce a deep learning video compression network, adapted and optimized specifically for LF image compression. We enhance this network by incorporating an in-loop filtering block, along with additional adjustments and fine-tuning. By treating LF images as pseudo video sequences and deploying our adapted network, we manage to address challenges presented by the unique features of LF images, such as high resolution and large data sizes.

Paper

CTRO 2025

Fully automatic reconstruction of prostate high-dose-rate brachytherapy interstitial needles using two-phase deep learning-based segmentation and object tracking algorithms
Clinical and Translational Radiation Oncology 2025
Image

In this paper, we present a two-phase deep learning method to automatically reconstruct prostate HDR brachytherapy needles from CT images. The first phase segments the needles using a Pix2Pix GAN, while the second employs a GOTURN tracker to predict their 3D trajectories. Trained and evaluated on clinical datasets, our approach achieved high segmentation accuracy and sub-millimeter localization error. By automating the time-consuming manual reconstruction process, this method has the potential to improve treatment planning efficiency and consistency in clinical practice.

Paper

arXiv

On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise
arXiv
Image

In this paper, we design and evaluate a memristor-based neural network hardware accelerator capable of on-chip training and inference. Using realistic SPICE models of silver-based memristors, we account for device variations, conductance errors, and noisy inputs. Our system, implemented with 30 memristors and 4 neurons, performs binary image classification while maintaining high accuracy and efficiency under non-ideal conditions. Results show that training with moderate noise improves robustness, and chromium-based memristors achieve the best balance of speed and energy efficiency. This work highlights the potential of memristor-based architectures for low-power, reliable edge computing applications.

Paper

MWSCAS 2024

Hardware Implementation of Memristor-Based in-Memory Computing for Classification Tasks
IEEE 67th International Midwest Symposium on Circuits and Systems (MWSCAS) 2024
Image

In this paper, we develop a hardware implementation of a memristor-based in-memory computing system for classification tasks. By integrating real memristor models into the Proteus simulation environment, we design a two-layer neural network with 30 memristors and 4 neurons, controlled by an Arduino microcontroller. An on-chip training algorithm is introduced to tune memristor conductance values, enabling efficient learning directly on the hardware. Tested on simple image classification tasks, the system achieves up to 97% accuracy with noisy inputs, demonstrating both robustness and practicality. This work provides a foundation for scalable memristor-based neural networks in edge computing.

Paper

IHBMC 2020

How Can Deep Learning Track Brain Metastasis Using Convolutional Neural Network?
7th Iranian Human Brain Mapping congress-9-12 Nov 2020
Image

In this paper, we propose a convolutional neural network (CNN)-based approach to track brain metastases from MRI data. Using three-dimensional T1-weighted MRI scans from 74 patients with metastases originating from breast, lung, prostate, and melanoma cancers, we trained and tested a CNN model to detect and trace tumor regions slice by slice. The network was evaluated against established methods such as Siam-FC and RT-MDNet, showing promising results in terms of success, precision, and processing speed. While initial slice detection remains challenging, the method demonstrates strong potential to assist radiologists in accurate diagnosis and reduce human error in clinical practice.

Skills

Languages

C
C++
Python
Java
JavaScript
MySQL
Visual Basic
MATLAB
Bash

Tools & Libraries

PyTorch
TensorFlow
OpenCV
CompressAI
Git
GitHub
Docker
HPC
VS Code/Cursor
LLMs

AI & Imaging

Light Field Compression
Variational Autoencoders
Feature Disentangling
RD Optimization
Deep Learning for Vision
Latent Entropy Models
VQ & Bitstream Design
QoE & Perceptual Metric
AR/VR Systems
GAN-based Enhancement

Research

Light Field Imaging & Compression
Deep Learning for Vision
Representation & Disentanglement
VAEs & Latent Modeling
Computer Vision & Understanding
Rate-Distortion & Entropy Coding
Machine Learning & AI
QoE & Perceptual Quality
Work Experience

Jan 2022

to

Current

Doctoral Researcher in Video/Image Compression
Mid Sweden University
Sundsvall, Sweden 🇸🇪
&
Université de Rennes I (Under INRIA)
Rennes, France 🇫🇷
Funding: (EU Project Horizon)
  • My main research work focuses on compression of multidimensional data including light field video and images.

  • The specialization of our research group is in the area of Representation of Light Fields, Compression, and Rendering

March 2025

to

May 2025

PhD Research Intern
RISE
Stockholm, Sweden 🇸🇪
  • Worked at RISE for 2 months as an research intern

  • Worked on adapting high-dimensional visual data pipelines for AR/VR environments, collaborating with Toyota-affiliated researchers

  • Key areas of focus were introducing and optimizing the VAE based end-to-end compression methods, and integrating learned light field commpression into immersive telepresence and automative systems

Jun 2020

to

Jun 2021

Computer Vision R&D Research Assistant
Shahid Beheshti University
Tehran, Iran 🇮🇷
  • Developed deep learning models for visual object tracking and video-based occlusion handling.

  • Designed segmentation and classification pipelines for traffic monitoring systems

  • Supported student projects in real-time computer vision.

Aug 2016

to

Jan 2022

Technical Engineer - Optical Networking and IoT
FiberHome Technologies
Tehran, Iran 🇮🇷
  • Led deployment of POTN/OTN and DWDM optical systems and managed end-to-end IoT applications.

  • Configured OTNM2000 platforms, designed high-capacity fiber networks, and implemented DCN using Cisco tools for national-scale infrastructureKey point 2

Education

2019 - 2021

Master's Degree
M.Sc. in Digital Electronic Engineering

Shahid Beheshti University

2009 - 2013

Bachelor's Degree
B.Sc. in Electrical Engineering

Shahid Beheshti University

Other Activity
  • Reviewer: IEEE Transactions on Multimedia (2025 - Present)
  • Reviewer: ACM Multimedia (2025 - Present)
  • Reviewer: IEEE Access (2025 - Present)
  • Reviewer: QoMEX (2024 - Present)