Updated on 2024.11.27
Usage instructions: here
Point Cloud Compression
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-09 | Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data | Xinran Liu et.al. | 2411.06055 | null |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-10-28 | Quality Analysis of the Coding Bitrate Tradeoff Between Geometry and Attributes for Colored Point Clouds | Joao Prazeres et.al. | 2410.21613 | null |
2024-10-09 | Point Cloud Compression with Bits-back Coding | Nguyen Quang Hieu et.al. | 2410.18115 | null |
2024-10-23 | Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Kai Liu et.al. | 2410.17823 | link |
2024-10-22 | Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs | Jihe Li et.al. | 2410.17001 | link |
2024-10-21 | MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering | Jiayi Song et.al. | 2410.15941 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-06 | Tensor-Train Point Cloud Compression and Efficient Approximate Nearest-Neighbor Search | Georgii Novikov et.al. | 2410.04462 | null |
2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | null |
2024-09-19 | PVContext: Hybrid Context Model for Point Cloud Compression | Guoqing Zhang et.al. | 2409.12724 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-08-20 | End-to-end learned Lossy Dynamic Point Cloud Attribute Compression | Dat Thanh Nguyen et.al. | 2408.10665 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-01 | Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control | Michael Rudolph et.al. | 2408.00599 | null |
2024-07-22 | Double Deep Learning-based Event Data Coding and Classification | Abdelrahman Seleem et.al. | 2407.15531 | null |
2024-07-11 | Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction | Chang Sun et.al. | 2407.08528 | null |
2024-07-11 | Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss | Chang Sun et.al. | 2407.08520 | null |
2024-07-19 | PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-05 | Rethinking Data Input for Point Cloud Upsampling | Tongxu Zhang et.al. | 2407.04476 | null |
2024-08-26 | TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting | Zixi Guo et.al. | 2407.04284 | link |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-09-25 | Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Yueyu Hu et.al. | 2406.05915 | null |
2024-06-02 | Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor | Lei Liu et.al. | 2406.00791 | null |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-02 | PointCompress3D – A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-21 | Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes | Kang You et.al. | 2404.13550 | link |
2024-04-16 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression | Kang You et.al. | 2404.06936 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-03-13 | Point Cloud Compression via Constrained Optimal Transport | Zezeng Li et.al. | 2403.08236 | link |
2024-03-08 | Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Hang Du et.al. | 2403.05117 | link |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-02-23 | Scalable Human-Machine Point Cloud Compression | Mateen Ulhaq et.al. | 2402.12532 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-11 | PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression | Jiahao Pang et.al. | 2402.07243 | null |
2024-02-07 | Performance analysis of Deep Learning-based Lossy Point Cloud Geometry Compression Coding Solutions | Joao Prazeres et.al. | 2402.05192 | null |
2024-02-08 | Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression | Davi Lazzarotto et.al. | 2402.04760 | null |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2023-12-23 | Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling | Shujuan Li et.al. | 2312.15133 | null |
2024-03-13 | DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Yanlong Li et.al. | 2312.03298 | link |
2023-12-03 | A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling | Wentao Qu et.al. | 2312.02719 | link |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539 | null |
2023-11-22 | Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Tam Thuc Do et.al. | 2311.13533 | null |
2023-11-22 | Test-Time Augmentation for 3D Point Cloud Classification and Segmentation | Tuan-Anh Vu et.al. | 2311.13152 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-02 | Lightweight super resolution network for point cloud geometry compression | Wei Zhang et.al. | 2311.00970 | link |
2023-11-17 | Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification | Abdelrahman Seleem et.al. | 2310.18849 | null |
2023-10-13 | iPUNet:Iterative Cross Field Guided Point Cloud Upsampling | Guangshun Wei et.al. | 2310.09092 | link |
2024-03-15 | PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2024-02-16 | Quasi-Monte Carlo for 3D Sliced Wasserstein | Khai Nguyen et.al. | 2309.11713 | link |
2023-09-08 | Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression | Jin Heo et.al. | 2309.04549 | null |
2023-09-01 | Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning | Ahmed Hatem et.al. | 2308.16484 | null |
2024-02-08 | SCP: Spherical-Coordinate-based Learned Point Cloud Compression | Ao Luo et.al. | 2308.12535 | null |
2023-08-22 | Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection | Junsheng Zhou et.al. | 2308.11441 | link |
2023-08-11 | Learned Point Cloud Compression for Classification | Mateen Ulhaq et.al. | 2308.05959 | link |
2023-07-27 | FLiCR: A Fast and Lightweight LiDAR Point Cloud Compression Based on Lossy RI | Jin Heo et.al. | 2307.15005 | null |
2023-07-20 | Aggressive saliency-aware point cloud compression | Eleftheria Psatha et.al. | 2307.10741 | null |
2023-07-18 | Arbitrary point cloud upsampling via Dual Back-Projection Network | Zhi-Song Liu et.al. | 2307.08992 | null |
2023-06-01 | 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks | Lorenzo Berlincioni et.al. | 2306.01081 | null |
2023-05-16 | Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching | Shuting Xia et.al. | 2305.05356 | null |
2023-05-02 | Geometric Prior Based Deep Human Point Cloud Geometry Compression | Xinju Wu et.al. | 2305.01309 | null |
2023-05-02 | PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling | Dohoon Kim et.al. | 2305.01148 | link |
2023-04-24 | Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions | Yun He et.al. | 2304.11846 | link |
2023-04-01 | Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention | Tam Thuc Do et.al. | 2304.00335 | null |
2023-03-27 | NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation | Zehan Zheng et.al. | 2303.15126 | link |
2023-11-07 | GQE-Net: A Graph-based Quality Enhancement Network for Point Cloud Color Attribute | Jinrui Xing et.al. | 2303.13764 | link |
2023-03-22 | Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction | Jianqiang Wang et.al. | 2303.12917 | null |
2023-12-28 | Progressive Frame Patching for FoV-based Point Cloud Video Streaming | Tongyu Zong et.al. | 2303.08336 | null |
2023-12-03 | Parametric Surface Constrained Upsampler Network for Point Cloud | Pingping Cai et.al. | 2303.08240 | link |
2024-03-20 | Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model | Dat Thanh Nguyen et.al. | 2303.06519 | link |
2023-03-11 | Deep probabilistic model for lossless scalable point cloud attribute compression | Dat Thanh Nguyen et.al. | 2303.06517 | null |
2023-03-09 | BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression | Chia-Sheng Liu et.al. | 2303.04027 | null |
2023-02-13 | gpcgc: a green point cloud geometry coding method | Qingyang Zhou et.al. | 2302.06062 | null |
2023-02-09 | BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios | Ali Ak et.al. | 2302.04796 | null |
2023-04-27 | Linear Optimal Partial Transport Embedding | Yikun Bai et.al. | 2302.03232 | link |
2023-01-31 | Lidar Upsampling with Sliced Wasserstein Distance | Artem Savkin et.al. | 2301.13558 | null |
2023-01-28 | Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding | Jianqiang Wang et.al. | 2301.12165 | null |
2023-01-27 | Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped Support | Viktoria Heimann et.al. | 2301.11630 | null |
2023-01-03 | Reduced Reference Quality Assessment for Point Cloud Compression | Yipeng Liu et.al. | 2301.01009 | null |
2023-04-06 | Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program | Tiange Luo et.al. | 2212.12952 | null |
2022-12-11 | Learning Neural Volumetric Field for Point Cloud Geometry Compression | Yueyu Hu et.al. | 2212.05589 | link |
2022-12-01 | Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery | Yisi Luo et.al. | 2212.00262 | null |
2023-12-09 | ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression | Yiqi Jin et.al. | 2211.10916 | null |
2022-11-19 | Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression | Pan Gao et.al. | 2211.10646 | null |
2022-10-21 | Motion Policy Networks | Adam Fishman et.al. | 2210.12209 | link |
2022-10-28 | Motion estimation and filtered prediction for dynamic point cloud attribute compression | Haoran Hong et.al. | 2210.08262 | null |
2022-10-08 | Point Cloud Upsampling via Cascaded Refinement Network | Hang Du et.al. | 2210.03942 | link |
2023-02-14 | Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression | Tingyu Fan et.al. | 2209.12512 | null |
2022-09-17 | CARNet:Compression Artifact Reduction for Point Cloud Attribute | Dandan Ding et.al. | 2209.08276 | null |
2022-11-16 | CU-Net: Real-Time High-Fidelity Color Upsampling for Point Clouds | Lingdong Wang et.al. | 2209.06112 | link |
2022-09-09 | GRASP-Net: Geometric Residual Analysis and Synthesis for Point Cloud Compression | Jiahao Pang et.al. | 2209.04401 | link |
2022-09-06 | Learning to Predict on Octree for Scalable Point Cloud Geometry Coding | Yixiang Mao et.al. | 2209.02226 | null |
2022-08-26 | Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention | Ruixiang Xue et.al. | 2208.12573 | null |
2022-08-17 | Efficient dynamic point cloud coding using Slice-Wise Segmentation | Faranak Tohidi et.al. | 2208.08061 | null |
2023-01-10 | Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians | Anthony Dell’Eva et.al. | 2208.05274 | link |
2022-08-04 | IT/IST/IPLeiria Response to the Call for Proposals on JPEG Pleno Point Cloud Coding | André F. R. Guarda et.al. | 2208.02716 | null |
2022-08-04 | IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression | Kang You et.al. | 2208.02519 | link |
2022-07-25 | Inter-Frame Compression for Dynamic Point Cloud Geometry Coding | Anique Akhtar et.al. | 2207.12554 | null |
2022-07-20 | GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori et.al. | 2207.09763 | link |
2022-06-25 | BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling | Yechao Bai et.al. | 2206.12648 | null |
2022-06-24 | Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression | Christian Herglotz et.al. | 2206.12186 | null |
2022-05-24 | A Rate Control Algorithm for Video-based Point Cloud Compression | Fangyu Shen et.al. | 2205.11825 | null |
2022-05-19 | A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling | Qiang Li et.al. | 2205.09594 | null |
2022-05-02 | D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction | Tingyu Fan et.al. | 2205.01135 | link |
2022-05-02 | Point Cloud Compression with Sibling Context and Surface Priors | Zhili Chen et.al. | 2205.00760 | link |
2022-04-29 | Deep Geometry Post-Processing for Decompressed Point Clouds | Xiaoqing Fan et.al. | 2204.13952 | link |
2022-04-27 | Density-preserving Deep Point Cloud Compression | Yun He et.al. | 2204.12684 | null |
2022-04-25 | 4DAC: Learning Attribute Compression for Dynamic Point Clouds | Guangchi Fang et.al. | 2204.11723 | null |
2022-04-25 | Dynamic Point Cloud Compression with Cross-Sectional Approach | Faranak Tohidi et.al. | 2204.11409 | null |
2022-04-22 | PU-EVA: An Edge Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling | Luqing Luo et.al. | 2204.10750 | null |
2022-04-18 | Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation | Wenbo Zhao et.al. | 2204.08196 | link |
2022-06-22 | Learning-based Lossless Point Cloud Geometry Coding using Sparse Tensors | Dat Thanh Nguyen et.al. | 2204.05043 | null |
2022-04-03 | Sparse Tensor-based Point Cloud Attribute Compression | Jianqiang Wang et.al. | 2204.01023 | link |
2022-03-22 | IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment | Yiming Zeng et.al. | 2203.11590 | link |
2022-03-21 | Upsampling Autoencoder for Self-Supervised Point Cloud Learning | Cheng Zhang et.al. | 2203.10768 | null |
2022-05-03 | Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds | Viktoria Heimann et.al. | 2203.09224 | null |
2022-03-02 | PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling | Hao Liu et.al. | 2203.00914 | null |
2022-05-16 | Variable Rate Compression for Raw 3D Point Clouds | Md Ahmed Al Muzaddid et.al. | 2202.13862 | link |
2022-09-14 | Point cloud completion via structured feature maps using a feedback network | Zejia Su et.al. | 2202.08583 | null |
2022-05-08 | OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression | Chunyang Fu et.al. | 2202.06028 | link |
2022-02-01 | Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison | Francesco Nardo et.al. | 2202.00719 | null |
2022-02-01 | Fractional Motion Estimation for Point Cloud Compression | Haoran Hong et.al. | 2202.00172 | null |
2022-01-17 | SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations | Zhenyu Li et.al. | 2112.04680 | link |
2022-03-31 | Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling | Wanquan Feng et.al. | 2112.04148 | link |
2022-03-01 | Attribute Artifacts Removal for Geometry-based Point Cloud Compression | Xihua Sheng et.al. | 2112.00560 | null |
2022-10-03 | PU-Transformer: Point Cloud Upsampling Transformer | Shi Qiu et.al. | 2111.12242 | link |
2022-10-21 | Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2111.10633 | link |
2021-10-18 | Patch-Based Deep Autoencoder for Point Cloud Geometry Compression | Kang You et.al. | 2110.09109 | link |
2022-07-12 | PC $^2$ -PU: Patch Correlation and Point Correlation for Effective Point Cloud Upsampling | Chen Long et.al. | 2109.09337 | link |
2021-09-16 | R-PCC: A Baseline for Range Image-based Point Cloud Compression | Sukai Wang et.al. | 2109.07717 | link |
2021-09-15 | Which One is Better: Assessing Objective Metrics for Point Cloud Compression | Yipeng Liu et.al. | 2109.07158 | null |
2021-08-05 | Joint Geometry and Color Projection-based Point Cloud Quality Metric | Alireza Javaheri et.al. | 2108.02481 | link |
2021-08-03 | SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering | Yifan Zhao et.al. | 2108.00454 | link |
2021-07-29 | Video-based Point Cloud Compression Artifact Removal | Anique Akhtar et.al. | 2107.14179 | null |
2024-02-28 | Score-Based Point Cloud Denoising | Shitong Luo et.al. | 2107.10981 | link |
2022-06-08 | PU-Flow: a Point Cloud Upsampling Network with Normalizing Flows | Aihua Mao et.al. | 2107.05893 | link |
2022-04-18 | “Zero-Shot” Point Cloud Upsampling | Kaiyue Zhou et.al. | 2106.13765 | link |
2021-06-23 | Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction | Qian Yin et.al. | 2106.12236 | null |
2021-06-21 | Cylindrical coordinates for LiDAR point cloud compression | Shashank N. Sridhara et.al. | 2106.11237 | null |
2021-10-11 | Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds | Emre Can Kaya et.al. | 2106.06482 | link |
2021-06-09 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li et.al. | 2106.04779 | link |
2021-06-02 | DeepCompress: Efficient Point Cloud Geometry Compression | Ryan Killea et.al. | 2106.01504 | null |
2021-06-01 | RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network | Lili Zhao et.al. | 2106.00496 | null |
2021-05-28 | An Unsupervised Optical Flow Estimation For LiDAR Image Sequences | Xuezhou Guo et.al. | 2105.13879 | null |
2021-05-05 | VoxelContext-Net: An Octree based Framework for Point Cloud Compression | Zizheng Que et.al. | 2105.02158 | null |
2021-04-20 | Multiscale deep context modeling for lossless point cloud geometry compression | Dat Thanh Nguyen et.al. | 2104.09859 | link |
2021-04-12 | Towards Efficient Graph Convolutional Networks for Point Cloud Handling | Yawei Li et.al. | 2104.05706 | null |
2021-03-11 | Advanced Geometry Surface Coding for Dynamic Point Cloud Compression | Jian Xiong et.al. | 2103.06549 | null |
2021-03-05 | Hybrid Point Cloud Semantic Compression for Automotive Sensors: A Performance Evaluation | Andrea Varischio et.al. | 2103.03819 | null |
2021-02-26 | Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction | Rajat Sharma et.al. | 2102.13391 | link |
2021-02-25 | A deep perceptual metric for 3D point clouds | Maurice Quach et.al. | 2102.12839 | link |
2021-02-08 | Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud | Shuquan Ye et.al. | 2102.04317 | null |
2020-12-15 | NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression | Nicolas Wagner et.al. | 2012.08143 | null |
2022-06-11 | SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization | Xinhai Liu et.al. | 2012.04439 | link |
2021-11-18 | Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning | Mohamed K. Abdel-Aziz et.al. | 2012.03414 | null |
2020-12-05 | ParaNet: Deep Regular Representation for 3D Point Clouds | Qijian Zhang et.al. | 2012.03028 | null |
2020-11-27 | Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds | Guangming Wang et.al. | 2011.13784 | null |
2020-11-25 | Reduced Reference Perceptual Quality Model and Application to Rate Control for 3D Point Cloud Compression | Qi Liu et.al. | 2011.12688 | null |
2020-11-07 | Multiscale Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2011.03799 | link |
2020-10-29 | Point Cloud Attribute Compression via Successive Subspace Graph Transform | Yueru Chen et.al. | 2010.15302 | null |
2020-08-16 | Real-Time Spatio-Temporal LiDAR Point Cloud Compression | Yu Feng et.al. | 2008.06972 | link |
2021-08-03 | Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display | Xinju Wu et.al. | 2008.02501 | null |
2020-06-20 | Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision | Haojie Liu et.al. | 2006.11481 | null |
2020-06-24 | Improved Deep Point Cloud Geometry Compression | Maurice Quach et.al. | 2006.09043 | link |
2020-04-03 | Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona et.al. | 2004.01661 | link |
2020-03-30 | A generalized Hausdorff distance based quality metric for point cloud geometry | Alireza Javaheri et.al. | 2003.13669 | null |
2020-03-30 | Optimizing Geometry Compression using Quantum Annealing | Sebastian Feld et.al. | 2003.13253 | null |
2020-03-27 | Model-based Joint Bit Allocation between Geometry and Color for Video-based 3D Point Cloud Compression | Qi Liu et.al. | 2002.10798 | null |
2020-03-07 | PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian et.al. | 2002.10277 | null |
2020-06-22 | Folding-based compression of point cloud attributes | Maurice Quach et.al. | 2002.04439 | null |
2020-01-13 | Efficient 3D Road Map Data Exchange for Intelligent Vehicles in Vehicular Fog Networks | Ivan Wang-Hei Ho et.al. | 2001.04057 | null |
2020-01-12 | Linear Model based Geometry Coding for Lidar Acquired Point Clouds | Xiang Zhang et.al. | 2001.03871 | null |
2021-04-09 | PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection | Shaoshuai Shi et.al. | 1912.13192 | link |
2019-12-20 | A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression | Hao Liu et.al. | 1912.09674 | null |
2020-10-15 | Point Cloud Rendering after Coding: Impacts on Subjective and Objective Quality | Alireza Javaheri et.al. | 1912.09137 | null |
2021-03-29 | PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks | Guocheng Qian et.al. | 1912.03264 | link |
2019-11-04 | Video-based compression for plenoptic point clouds | Li Li et.al. | 1911.01355 | null |
2019-09-26 | Learned Point Cloud Geometry Compression | Jianqiang Wang et.al. | 1909.12037 | link |
2019-09-16 | PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation | Haojie Liu et.al. | 1909.07137 | null |
2019-08-17 | 3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals | Chinthaka Dinesh et.al. | 1908.06261 | null |
2019-08-06 | Point Cloud Super Resolution with Adversarial Residual Graph Networks | Huikai Wu et.al. | 1908.02111 | link |
2020-08-10 | Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds | Yiqun Xu et.al. | 1908.01970 | null |
2019-07-25 | PU-GAN: a Point Cloud Upsampling Adversarial Network | Ruihui Li et.al. | 1907.10844 | null |
2019-06-27 | A Convolutional Decoder for Point Clouds using Adaptive Instance Normalization | Isaak Lim et.al. | 1906.11478 | null |
2019-04-18 | Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds | Wei Yan et.al. | 1905.03691 | null |
2019-05-22 | Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression | Maurice Quach et.al. | 1903.08548 | link |
2019-09-30 | Variational Graph Methods for Efficient Point Cloud Sparsification | Daniel Tenbrinck et.al. | 1903.02858 | null |
2019-03-05 | Pose Estimation of Vehicles Over Uneven Terrain | Yingchong Ma et.al. | 1903.02052 | null |
2019-02-11 | Occupancy-map-based rate distortion optimization for video-based point cloud compression | Li Li et.al. | 1902.04169 | null |
2018-09-30 | A Volumetric Approach to Point Cloud Compression | Maja Krivokuća et.al. | 1810.00484 | null |
2018-05-29 | Surface Light Field Compression using a Point Cloud Codec | Xiang Zhang et.al. | 1805.11203 | null |
2018-05-23 | Comments on “Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform” | Gustavo Sandri et.al. | 1805.09146 | null |
2018-04-28 | Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction | Yiting Shao et.al. | 1804.10783 | null |
2018-03-26 | PU-Net: Point Cloud Upsampling Network | Lequan Yu et.al. | 1801.06761 | link |
2017-10-10 | Attribute Compression of 3D Point Clouds Using Laplacian Sparsity Optimized Graph Transform | Yiting Shao et.al. | 1710.03532 | null |
2017-03-08 | Dynamic Polygon Clouds: Representation and Compression for VR/AR | Philip A. Chou et.al. | 1610.00402 | null |
Compression
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-26 | Evaluating the Overhead of the Performance Profiler Cloudprofiler With MooBench | Shinhyung Yang et.al. | 2411.17413 | null |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-23 | An Information-Theoretic Regularizer for Lossy Neural Image Compression | Yingwen Zhang et.al. | 2411.16727 | null |
2024-11-25 | WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing | Kai Han et.al. | 2411.16336 | null |
2024-11-25 | Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression | Xi Zhang et.al. | 2411.16119 | null |
2024-11-25 | TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation | Huanqi Yang et.al. | 2411.16020 | null |
2024-11-24 | Variable-size Symmetry-based Graph Fourier Transforms for image compression | Alessandro Gnutti et.al. | 2411.15824 | null |
2024-11-24 | M3-CVC: Controllable Video Compression with Multimodal Generative Models | Rui Wan et.al. | 2411.15798 | null |
2024-11-24 | Advanced Learning-Based Inter Prediction for Future Video Coding | Yanchen Zhao et.al. | 2411.15759 | null |
2024-11-24 | PEnG: Pose-Enhanced Geo-Localisation | Tavis Shore et.al. | 2411.15742 | null |
2024-11-21 | U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation | Tingyu Fan et.al. | 2411.14501 | null |
2024-11-21 | Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems | Yinghao Zhang et.al. | 2411.14141 | link |
2024-11-21 | Compact Visual Data Representation for Green Multimedia – A Human Visual System Perspective | Peilin Chen et.al. | 2411.14135 | null |
2024-11-21 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Benchmarking Quantum Convolutional Neural Networks for Classification and Data Compression Tasks | Jun Yong Khoo et.al. | 2411.13468 | null |
2024-11-20 | Practical Compact Deep Compressed Sensing | Bin Chen et.al. | 2411.13081 | link |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-22 | Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need | Kecheng Chen et.al. | 2411.12448 | null |
2024-11-19 | Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness | Catie Cuan et.al. | 2411.12361 | null |
2024-11-18 | Variable Rate Neural Compression for Sparse Detector Data | Yi Huang et.al. | 2411.11942 | link |
2024-11-18 | Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods | Egor Kovalev et.al. | 2411.11795 | null |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards fast DBSCAN via Spectrum-Preserving Data Compression | Yongyu Wang et.al. | 2411.11421 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | null |
2024-11-16 | An End-to-End Real-World Camera Imaging Pipeline | Kepeng Xu et.al. | 2411.10773 | null |
2024-11-16 | Deep Learning-Based Image Compression for Wireless Communications: Impacts on Reliability,Throughput, and Latency | Mostafa Naseri et.al. | 2411.10650 | null |
2024-11-15 | Efficient Progressive Image Compression with Variance-aware Masking | Alberto Presta et.al. | 2411.10185 | link |
2024-11-15 | A Multi-Scale Spatial-Temporal Network for Wireless Video Transmission | Xinyi Zhou et.al. | 2411.09936 | null |
2024-11-14 | Application of signal separation to diffraction image compression and serial crystallography | Jérôme Kieffer et.al. | 2411.09515 | link |
2024-11-14 | DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Junqi Liu et.al. | 2411.09308 | null |
2024-11-14 | Towards efficient compression and communication for prototype-based decentralized learning | Pablo Fernández-Piñeiro et.al. | 2411.09267 | null |
2024-11-13 | Learning Optimal and Interpretable Summary Statistics of Galaxy Catalogs with SBI | Kai Lehman et.al. | 2411.08957 | null |
2024-11-13 | LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing | Xiaonan Nie et.al. | 2411.08446 | null |
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-11 | Accelerating radio astronomy imaging with RICK | Emanuele De Rubeis et.al. | 2411.07321 | link |
2024-11-11 | Low Complexity Learning-based Lossless Event-based Compression | Ahmadreza Sezavar et.al. | 2411.07155 | null |
2024-11-11 | JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset | Daria Tsereh et.al. | 2411.06810 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | High-Frequency Enhanced Hybrid Neural Representation for Video Compression | Li Yu et.al. | 2411.06685 | null |
2024-11-09 | HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data | Zhewen Xu et.al. | 2411.06155 | null |
2024-11-08 | A method based on Generative Adversarial Networks for disentangling physical and chemical properties of stars in astronomical spectra | Raúl Santoveña et.al. | 2411.05960 | null |
2024-11-07 | Don’t Look Twice: Faster Video Transformers with Run-Length Tokenization | Rohan Choudhury et.al. | 2411.05222 | null |
2024-11-05 | Tuning into spatial frequency space: Satellite and space debris detection in the ZTF alert stream | J. P. Carvajal et.al. | 2411.03258 | null |
2024-11-15 | ZipCache: A DRAM/SSD Cache with Built-in Transparent Compression | Rui Xie et.al. | 2411.03174 | null |
2024-11-05 | Learning-based Lossless Event Data Compression | Ahmadreza Sezavar et.al. | 2411.03010 | null |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-04 | The evolution of volumetric video: A survey of smart transcoding and compression approaches | Preetish Kakkar et.al. | 2411.02095 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-11-02 | Autoencoders for At-Source Data Reduction and Anomaly Detection in High Energy Particle Detectors | Alexander Yue et.al. | 2411.01118 | null |
2024-11-01 | SANN-PSZ: Spatially Adaptive Neural Network for Head-Tracked Personal Sound Zones | Yue Qiao et.al. | 2411.00772 | null |
2024-10-28 | MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression | Noel Elias et.al. | 2410.21548 | link |
2024-10-29 | Enhancing Learned Image Compression via Cross Window-based Attention | Priyanka Mudgal et.al. | 2410.21144 | null |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-25 | Conditional Hallucinations for Image Compression | Till Aczel et.al. | 2410.19493 | null |
2024-10-29 | Integration of Communication and Computational Imaging | Zhenming Yu et.al. | 2410.19415 | null |
2024-10-24 | DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy | Huan Cui et.al. | 2410.18400 | null |
2024-10-23 | Predicting total time to compress a video corpus using online inference systems | Xin Shu et.al. | 2410.18260 | null |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083 | null |
2024-10-23 | Learning Lossless Compression for High Bit-Depth Volumetric Medical Image | Kai Wang et.al. | 2410.17814 | null |
2024-10-21 | Variable Rate Learned Wavelet Video Coding with Temporal Layer Adaptivity | Anna Meyer et.al. | 2410.15873 | link |
2024-10-20 | Extensions on low-complexity DCT approximations for larger blocklengths based on minimal angle similarity | A. P. Radünz et.al. | 2410.15244 | null |
2024-10-19 | Standardizing Generative Face Video Compression using Supplemental Enhancement Information | Bolin Chen et.al. | 2410.15105 | null |
2024-10-16 | MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection | Bokai Lin et.al. | 2410.14731 | null |
2024-10-18 | Design and Prototype of a Unified Framework for Error-robust Compression and Encryption in IoT | Gajraj Kuldeep et.al. | 2410.14396 | null |
2024-10-18 | Compression using Discrete Multi-Level Divisor Transform for Heterogeneous Sensor Data | Gajraj Kuldeep et.al. | 2410.14287 | null |
2024-10-17 | In-context learning and Occam’s razor | Eric Elmoznino et.al. | 2410.14086 | link |
2024-10-17 | Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification | Nikolaos-Antonios Ypsilantis et.al. | 2410.13582 | null |
2024-10-16 | Test-time adaptation for image compression with distribution regularization | Kecheng Chen et.al. | 2410.12191 | null |
2024-10-16 | Joint Data Compression, Secure Multi-Part Collaborative Task Offloading and Resource Assignment in Ultra-Dense Networks | Tianqing Zhou et.al. | 2410.12186 | null |
2024-10-14 | Large Language Model Evaluation via Matrix Nuclear-Norm | Yahan Li et.al. | 2410.10672 | link |
2024-10-14 | QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models | Zhumazhan Balapanov et.al. | 2410.10318 | link |
2024-10-14 | Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization | Shanzhi Yin et.al. | 2410.10171 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-13 | Compressing Scene Dynamics: A Generative Approach | Shanzhi Yin et.al. | 2410.09768 | link |
2024-10-13 | ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression | Wei Jiang et.al. | 2410.09706 | link |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | null |
2024-10-11 | Fast Data-independent KLT Approximations Based on Integer Functions | A. P. Radünz et.al. | 2410.09227 | null |
2024-10-10 | Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model | Qian Liu et.al. | 2410.09109 | null |
2024-10-11 | Data-Driven Neural Estimation of Indirect Rate-Distortion Function | Zichao Yu et.al. | 2410.09018 | null |
2024-10-11 | Compressing regularised dynamics improves link prediction in sparse networks | Maja Lindström et.al. | 2410.08777 | link |
2024-10-11 | Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens | Bolin Chen et.al. | 2410.08485 | link |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-16 | Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression | Takahiro Shindo et.al. | 2410.07669 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | null |
2024-10-10 | R-Adaptive Mesh Optimization to Enhance Finite Element Basis Compression | Graham Harper et.al. | 2410.07646 | null |
2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
2024-10-09 | SHRINK: Data Compression by Semantic Extraction and Residuals Encoding | Guoyou Sun et.al. | 2410.06713 | null |
2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
2024-10-09 | Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Wenqi Niu et.al. | 2410.06561 | null |
2024-10-08 | Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression | Weigutian Ou et.al. | 2410.06378 | null |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | Resolution limit of the eye: how many pixels can we see? | Maliha Ashraf et.al. | 2410.06068 | null |
2024-10-07 | Transformers learn variable-order Markov chains in-context | Ruida Zhou et.al. | 2410.05493 | null |
2024-10-07 | Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers | Cyan Subhra Mishra et.al. | 2410.05435 | null |
2024-10-07 | Causal Context Adjustment Loss for Learned Image Compression | Minghao Han et.al. | 2410.04847 | link |
2024-10-06 | Channel-Aware Throughput Maximization for Cooperative Data Fusion in CAV | Haonan An et.al. | 2410.04320 | null |
2024-10-05 | Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception | Zhengru Fang et.al. | 2410.04168 | null |
2024-10-04 | On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding | Yi-Hsin Chen et.al. | 2410.03898 | null |
2024-10-04 | A Framework for Automatic Validation and Application of Lossy Data Compression in Ensemble Data Assimilation | Kai Keller et.al. | 2410.03184 | null |
2024-10-03 | GABIC: Graph-based Attention Block for Image Compression | Gabriele Spadaro et.al. | 2410.02981 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | High-Efficiency Neural Video Compression via Hierarchical Predictive Learning | Ming Lu et.al. | 2410.02598 | link |
2024-10-02 | A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Liang Chen et.al. | 2410.01912 | link |
2024-10-02 | COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation | Ziyuan Zhang et.al. | 2410.01698 | link |
2024-10-03 | Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression | Gai Zhang et.al. | 2410.01654 | null |
2024-10-02 | Task-Oriented Edge-Assisted Cooperative Data Compression, Communications and Computing for UGV-Enhanced Warehouse Logistics | Jiaming Yang et.al. | 2410.01515 | null |
2024-10-01 | STanH : Parametric Quantization for Variable Rate Learned Image Compression | Alberto Presta et.al. | 2410.00557 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | PerCo (SD): Open Perceptual Compression | Nikolai Körber et.al. | 2409.20255 | link |
2024-09-29 | All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation | Xu Zhang et.al. | 2409.19660 | link |
2024-09-28 | Fast Encoding and Decoding for Implicit Video Representation | Hao Chen et.al. | 2409.19429 | null |
2024-09-27 | Learning-Based Image Compression for Machines | Kartik Gupta et.al. | 2409.19184 | link |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Neural Video Representation for Redundancy Reduction and Consistency Preservation | Taiga Hayami et.al. | 2409.18497 | null |
2024-09-20 | Blockchain-Enabled Variational Information Bottleneck for Data Extraction Based on Mutual Information in Internet of Vehicles | Cui Zhang et.al. | 2409.17287 | null |
2024-09-25 | Streaming Neural Images | Marcos V. Conde et.al. | 2409.17134 | null |
2024-09-25 | PhD Forum: Efficient Privacy-Preserving Processing via Memory-Centric Computing | Mpoki Mwaisela et.al. | 2409.16777 | null |
2024-09-25 | The Effect of Lossy Compression on 3D Medical Images Segmentation with Deep Learning | Anvar Kurmukov et.al. | 2409.16733 | null |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-25 | COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Kehui Liu et.al. | 2409.15146 | link |
2024-09-23 | AlphaZip: Neural Network-Enhanced Lossless Text Compression | Swathi Shree Narashiman et.al. | 2409.15046 | link |
2024-09-23 | Anomaly Detection from a Tensor Train Perspective | Alejandro Mata Ali et.al. | 2409.15030 | null |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-21 | Window-based Channel Attention for Wavelet-enhanced Learned Image Compression | Heng Xu et.al. | 2409.14090 | null |
2024-09-20 | Reduced bit median quantization: A middle process for Efficient Image Compression | Fikresilase Wondmeneh Abebayew et.al. | 2409.13789 | null |
2024-09-20 | Data Compression using Rank-1 Lattices for Parameter Estimation in Machine Learning | Michael Gnewuch et.al. | 2409.13453 | null |
2024-09-19 | Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees | Sai Sanjeet et.al. | 2409.13117 | link |
2024-09-19 | Optimal Coding for Randomized Kolmogorov Complexity and Its Applications | Shuichi Hirahara et.al. | 2409.12744 | null |
2024-09-19 | Multi-Scale Feature Prediction with Auxiliary-Info for Neural Image Compression | Chajin Shin et.al. | 2409.12719 | null |
2024-09-18 | One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation | Finn Lukas Busch et.al. | 2409.11764 | null |
2024-09-18 | LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution | Shiyu Feng et.al. | 2409.11711 | null |
2024-09-18 | k-mer-based approaches to bridging pangenomics and population genetics | Miles D. Roberts et.al. | 2409.11683 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-14 | Lossy Image Compression with Stochastic Quantization | Anton Kozyriev et.al. | 2409.09488 | null |
2024-09-13 | Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph | Samuel Fernández-Menduiña et.al. | 2409.08970 | null |
2024-09-13 | On the Computation of BD-Rate over a Set of Videos for Fair Assessment of Performance of Learned Video Codecs | M. Akin Yilmaz et.al. | 2409.08772 | null |
2024-09-13 | USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s | Zhuoyuan Li et.al. | 2409.08481 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression | John Mango et.al. | 2409.07028 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Rate-Constrained Quantization for Communication-Efficient Federated Learning | Shayan Mohajer Hamidi et.al. | 2409.06319 | null |
2024-09-09 | Design and Implementation of TAO DAQ System | Shuihan Zhang et.al. | 2409.05522 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds | Xiao Li et.al. | 2409.05357 | null |
2024-09-06 | Convolutional Transformer-Based Image Compression | Bouzid Arezki et.al. | 2409.04118 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-05 | Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Ali Aghababaei-Harandi et.al. | 2409.03555 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-10 | FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings | John Li et.al. | 2409.02453 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | Privacy-Preserving Multimedia Mobile Cloud Computing Using Protective Perturbation | Zhongze Tang et.al. | 2409.01710 | null |
2024-09-02 | Multi-Reference Generative Face Video Compression with Contrastive Learning | Goluck Konuko et.al. | 2409.01029 | null |
2024-09-02 | Accelerating block-level rate control for learned image compression | Muchen Dong et.al. | 2409.01009 | null |
2024-09-02 | PNVC: Towards Practical INR-based Video Compression | Ge Gao et.al. | 2409.00953 | null |
2024-09-01 | BWT construction and search at the terabase scale | Heng Li et.al. | 2409.00613 | link |
2024-08-30 | Prioritized Information Bottleneck Theoretic Framework with Distributed Online Learning for Edge Video Analytics | Zhengru Fang et.al. | 2409.00146 | null |
2024-08-28 | Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays | Zeheng Wang et.al. | 2409.00115 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Approximately Invertible Neural Network for Learned Image Compression | Yanbo Gao et.al. | 2408.17073 | null |
2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | null |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-27 | Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning | Zichen Tang et.al. | 2408.14736 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-12 | Semantic Variational Bayes Based on a Semantic Information Theory for Solving Latent Variables | Chenguang Lu et.al. | 2408.13122 | null |
2024-08-22 | Quantization-free Lossy Image Compression Using Integer Matrix Factorization | Pooya Ashtari et.al. | 2408.12691 | link |
2024-08-22 | DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding | Jooyoung Lee et.al. | 2408.12150 | null |
2024-08-28 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-20 | Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement | Sandra Bergmann et.al. | 2408.10823 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Bi-Directional Deep Contextual Video Compression | Xihua Sheng et.al. | 2408.08604 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-15 | Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression | Dimitris Floros et.al. | 2408.08439 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | Encoding and Decoding Algorithms of ANS Variants and Evaluation of Their Average Code Lengths | Hirosuke Yamamoto et.al. | 2408.07322 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-19 | Joint Source-Channel Optimization for UAV Video Coding and Transmission | Kesong Wu et.al. | 2408.06667 | null |
2024-08-08 | Flow-Lenia.png: Evolving Multi-Scale Complexity by Means of Compression | Tadashi Adachi et.al. | 2408.06374 | null |
2024-08-09 | Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration | Siyue Teng et.al. | 2408.05042 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-07 | Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression | Hamidreza Soltani et.al. | 2408.03842 | null |
2024-08-07 | BVI-AOM: A New Training Dataset for Deep Video Compression Optimization | Jakub Nawała et.al. | 2408.03265 | link |
2024-08-06 | Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring | Jeremy J. Williams et.al. | 2408.02869 | null |
2024-08-05 | Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation | McKell Woodland et.al. | 2408.02761 | link |
2024-08-04 | CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Xiang He et.al. | 2408.01952 | link |
2024-08-03 | Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks | Masoud Ghazikor et.al. | 2408.01885 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-07-31 | Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study | Mitra Amiri et.al. | 2408.00052 | null |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-30 | Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks | Peihao Dong et.al. | 2407.20772 | link |
2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | null |
2024-07-29 | Homomorphic data compression for real time photon correlation analysis | Sebastian Strempfer et.al. | 2407.20356 | null |
2024-07-24 | Accelerating the Low-Rank Decomposed Models | Habib Hajimolahoseini et.al. | 2407.20266 | null |
2024-07-29 | ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck | Chia-Hao Kao et.al. | 2407.19651 | null |
2024-07-28 | NVC-1B: A Large Neural Video Coding Model | Xihua Sheng et.al. | 2407.19402 | null |
2024-07-18 | Generative AI Augmented Induction-based Formal Verification | Aman Kumar et.al. | 2407.18965 | null |
2024-07-25 | The seismic purifier: An unsupervised approach to seismic signal detection via representation learning | Onur Efe et.al. | 2407.18402 | link |
2024-07-25 | Adaptable Deep Joint Source-and-Channel Coding for Small Satellite Applications | Olga Kondrateva et.al. | 2407.18146 | null |
2024-07-25 | Scaling Training Data with Lossy Image Compression | Katherine L. Mentzer et.al. | 2407.17954 | link |
2024-07-25 | Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks | Zhicheng Cai et.al. | 2407.17834 | link |
2024-07-24 | Lossy Data Compression By Adaptive Mesh Coarsening | N. Böing et.al. | 2407.17316 | null |
2024-07-24 | High Efficiency Image Compression for Large Visual-Language Models | Binzhe Li et.al. | 2407.17060 | null |
2024-07-23 | Accelerating Learned Video Compression via Low-Resolution Representation Learning | Zidian Qiu et.al. | 2407.16418 | null |
2024-07-24 | FCNR: Fast Compressive Neural Representation of Visualization Images | Yunfei Lu et.al. | 2407.16369 | link |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-18 | CIC: Circular Image Compression | Honggui Li et.al. | 2407.15870 | null |
2024-07-22 | Online String Attractors | Philip Whittington et.al. | 2407.15599 | null |
2024-07-22 | Spectral properties of bright deposits in permanently shadowed craters on Ceres | Stefan Schröder et.al. | 2407.15327 | null |
2024-07-21 | Lessons Learned on the Path to Guaranteeing the Error Bound in Lossy Quantizers | Alex Fallin et.al. | 2407.15037 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-18 | Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law | Giorgio Franceschelli et.al. | 2407.13493 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Reliability Function of Classical-Quantum Channels | Ke Li et.al. | 2407.12403 | null |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-16 | Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Matt Gorbett et.al. | 2407.12075 | null |
2024-07-17 | Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Jinming Liu et.al. | 2407.11700 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-17 | Neural Compression of Atmospheric States | Piotr Mirowski et.al. | 2407.11666 | null |
2024-07-16 | Rethinking Learned Image Compression: Context is All You Need | Jixiang Luo et.al. | 2407.11590 | null |
2024-07-16 | The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR | J. K. Chege et.al. | 2407.11557 | null |
2024-07-21 | Uniformly Accelerated Motion Model for Inter Prediction | Zhuoyuan Li et.al. | 2407.11541 | null |
2024-07-15 | M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance Segmentation | Abdollah Zakeri et.al. | 2407.11275 | link |
2024-07-15 | Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention | Prapti Ganguly et.al. | 2407.11102 | null |
2024-07-15 | In-Loop Filtering via Trained Look-Up Tables | Zhuoyuan Li et.al. | 2407.10926 | null |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | link |
2024-07-14 | UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers | Huy Ha et.al. | 2407.10353 | null |
2024-07-13 | WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Haisheng Fu et.al. | 2407.09983 | null |
2024-07-13 | Zero-Shot Image Compression with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.09896 | link |
2024-07-13 | Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation | Han Li et.al. | 2407.09853 | link |
2024-07-13 | Infinite families of optimal and minimal codes over rings using simplicial complexes | Yanan Wu et.al. | 2407.09783 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | OMR-NET: a two-stage octave multi-scale residual network for screen content image compression | Shiqi Jiang et.al. | 2407.08545 | null |
2024-07-11 | CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data | Hossein Entezari Zarch et.al. | 2407.08108 | null |
2024-07-10 | Using Low-Discrepancy Points for Data Compression in Machine Learning: An Experimental Comparison | Simone Göttlich et.al. | 2407.07450 | null |
2024-07-10 | Standard compliant video coding using low complexity, switchable neural wrappers | Yueyu Hu et.al. | 2407.07395 | null |
2024-07-10 | MNeRV: A Multilayer Neural Representation for Videos | Qingling Chang et.al. | 2407.07347 | link |
2024-07-11 | Entropy Law: The Story Behind Data Compression and LLM Performance | Mingjia Yin et.al. | 2407.06645 | link |
2024-07-08 | A Hybrid Algorithm for Computing a Partial Singular Value Decomposition Satisfying a Given Threshold | James Baglama et.al. | 2407.06306 | link |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-05 | The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Heng Lu et.al. | 2407.04803 | null |
2024-07-05 | An autoencoder for compressing angle-resolved photoemission spectroscopy data | Steinn Ymir Agustsson et.al. | 2407.04631 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-11 | A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization | Daoce Wang et.al. | 2407.04267 | null |
2024-07-04 | Autoencoded Image Compression for Secure and Fast Transmission | Aryan Kashyap Naveen et.al. | 2407.03990 | link |
2024-07-03 | Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations | Trevor Ablett et.al. | 2407.03311 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-01 | Statistical Analysis of ZFP: Understanding Bias | Alyson Fox et.al. | 2407.01826 | null |
2024-07-01 | An AI-based, Error-bounded Compression Scheme for High-frequency Power Quality Disturbance Data | Markus Stroot et.al. | 2407.01112 | null |
2024-06-28 | Wavelets Are All You Need for Autoregressive Image Generation | Wael Mattar et.al. | 2406.19997 | null |
2024-06-28 | Optimal Video Compression using Pixel Shift Tracking | Hitesh Saai Mananchery Panneerselvam et.al. | 2406.19630 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-25 | Asymptotically Minimax Regret by Bayes Mixtures | Jun’ichi Takeuchi et.al. | 2406.17929 | null |
2024-06-24 | Hierarchical B-frame Video Coding for Long Group of Pictures | Ivan Kirillov et.al. | 2406.16544 | null |
2024-06-20 | Ranking LLMs by compression | Peijia Guo et.al. | 2406.14171 | null |
2024-06-21 | Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective | Minsang Kim et.al. | 2406.14124 | null |
2024-06-20 | Prediction and Reference Quality Adaptation for Learned Video Compression | Xihua Sheng et.al. | 2406.14118 | null |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | A Study on the Effect of Color Spaces in Learned Image Compression | Srivatsa Prativadibhayankaram et.al. | 2406.13709 | null |
2024-06-19 | Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics | Weitong Zhang et.al. | 2406.13652 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | null |
2024-06-15 | How Should We Extract Discrete Audio Tokens from Self-Supervised Models? | Pooneh Mousavi et.al. | 2406.10735 | null |
2024-06-15 | Object-Attribute-Relation Representation based Video Semantic Communication | Qiyuan Du et.al. | 2406.10469 | null |
2024-06-14 | On Efficient Neural Network Architectures for Image Compression | Yichi Zhang et.al. | 2406.10361 | link |
2024-06-14 | Information Compression in the AI Era: Recent Advances and Future Challenges | Jun Chen et.al. | 2406.10036 | null |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Yi-Fan Zhang et.al. | 2406.08487 | link |
2024-06-12 | On Annotation-free Optimization of Video Coding for Machines | Marc Windsheimer et.al. | 2406.07938 | null |
2024-06-11 | SSNVC: Single Stream Neural Video Compression with Implicit Temporal Information | Feng Wang et.al. | 2406.07645 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Optimal Matrix-Mimetic Tensor Algebras via Variable Projection | Elizabeth Newman et.al. | 2406.06942 | link |
2024-06-10 | Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency | Jincheng Dai et.al. | 2406.06446 | null |
2024-06-10 | Image Compression with Isotropic and Anisotropic Shepard Inpainting | Rahul Mohideen Kaja Mohideen et.al. | 2406.06247 | null |
2024-06-10 | Efficient Neural Compression with Inference-time Decoding | C. Metz et.al. | 2406.06237 | null |
2024-06-10 | Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis | A. Pérez-Fernández et.al. | 2406.06085 | null |
2024-06-10 | Quantum Sparse Coding and Decoding Based on Quantum Network | Xun Ji et.al. | 2406.06012 | null |
2024-06-09 | Region of Interest Loss for Anonymizing Learned Image Compression | Christoph Liebender et.al. | 2406.05726 | link |
2024-06-08 | Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models | Minho Park et.al. | 2406.05432 | link |
2024-06-07 | PatchSVD: A Non-uniform SVD-based Image Compression Algorithm | Zahra Golpayegani et.al. | 2406.05129 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | link |
2024-06-05 | Lossless Image Compression Using Multi-level Dictionaries: Binary Images | Samar Agnihotri et.al. | 2406.03087 | null |
2024-06-05 | On Jacob Ziv’s Individual-Sequence Approach to Information Theory | Neri Merhav et.al. | 2406.02904 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-05 | Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Anqi Li et.al. | 2406.00758 | link |
2024-06-01 | Efficient Massive Black Hole Binary parameter estimation for LISA using Sequential Neural Likelihood | Iván Martín Vílchez et.al. | 2406.00565 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-30 | Quantum encoder for fixed Hamming-weight subspaces | Renato M. S. Farias et.al. | 2405.20408 | null |
2024-05-29 | Implicit Neural Image Field for Biological Microscopy Image Compression | Gaole Dai et.al. | 2405.19012 | link |
2024-05-28 | Deep Network Pruning: A Comparative Study on CNNs in Face Recognition | Fernando Alonso-Fernandez et.al. | 2405.18302 | null |
2024-05-28 | Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder | Wenlong Gou et.al. | 2405.18255 | null |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-27 | UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation | Runzhao Yang et.al. | 2405.16850 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-25 | N-BVH: Neural ray queries with bounding volume hierarchies | Philippe Weier et.al. | 2405.16237 | link |
2024-05-25 | A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior | Fuheng Zhou et.al. | 2405.16197 | link |
2024-05-24 | Analytical proxy to families of numerical solutions: the case study of spherical mini-boson stars | Jianzhi Yang et.al. | 2405.15651 | null |
2024-05-24 | SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing | Haoxuan Yuan et.al. | 2405.15542 | null |
2024-05-24 | Meta-meshing and triangulating lattice structures at a large scale | Qiang Zou et.al. | 2405.15197 | null |
2024-05-23 | NeCGS: Neural Compression for 3D Geometry Sets | Siyu Ren et.al. | 2405.15034 | null |
2024-05-23 | An augmented Lagrangian trust-region method with inexact gradient evaluations to accelerate constrained optimization problems using model hyperreduction | Tianshu Wen et.al. | 2405.14827 | null |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-06-01 | I $^2$ VC: A Unified Framework for Intra- & Inter-frame Video Compression | Meiqin Liu et.al. | 2405.14336 | link |
2024-05-23 | Sparse $L^1$ -Autoencoders for Scientific Data Compression | Matthias Chung et.al. | 2405.14270 | null |
2024-05-22 | “Turing Tests” For An AI Scientist | Xiaoxin Yin et.al. | 2405.13352 | null |
2024-05-21 | Efficient Learned Wavelet Image and Video Coding | Anna Meyer et.al. | 2405.12631 | null |
2024-05-24 | Accelerating Relative Entropy Coding with Space Partitioning | Jiajun He et.al. | 2405.12203 | null |
2024-05-20 | Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing | Takahiro Shindo et.al. | 2405.11894 | null |
2024-05-19 | Effective In-Context Example Selection through Data Compression | Zhongxiang Sun et.al. | 2405.11465 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results | M. Gatti et.al. | 2405.10881 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-15 | Properties that allow or prohibit transferability of adversarial attacks among quantized networks | Abhishek Shrestha et.al. | 2405.09598 | link |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-14 | Parameter-Efficient Instance-Adaptive Neural Video Compression | Hyunmo Yang et.al. | 2405.08530 | link |
2024-05-13 | Goal-oriented compression for $L_p$ -norm-type goal functions: Application to power consumption scheduling | Yifei Sun et.al. | 2405.07808 | null |
2024-05-13 | Neural Network Compression for Reinforcement Learning Tasks | Dmitry A. Ivanov et.al. | 2405.07748 | null |
2024-05-13 | On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks | Chenhao Wu et.al. | 2405.07717 | null |
2024-05-21 | An Efficient Compression Method for Sign Information of DCT Coefficients via Sign Retrieval | Chihiro Tsutake et.al. | 2405.07487 | link |
2024-05-10 | Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming | Chin-Yun Yu et.al. | 2405.06804 | link |
2024-05-08 | Urban Boundary Delineation from Commuting Data with Bayesian Stochastic Blockmodeling: Scale, Contiguity, and Hierarchy | Sebastian Morel-Balbi et.al. | 2405.04911 | link |
2024-05-14 | Some Notes on the Sample Complexity of Approximate Channel Simulation | Gergely Flamich et.al. | 2405.04363 | null |
2024-05-07 | Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression | Zhenghao Chen et.al. | 2405.04274 | null |
2024-05-08 | Verified Neural Compressed Sensing | Rudy Bunel et.al. | 2405.04260 | null |
2024-05-15 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | DMOFC: Discrimination Metric-Optimized Feature Compression | Changsheng Gao et.al. | 2405.04044 | null |
2024-05-06 | Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices | Yi-Ning Zhao et.al. | 2405.03729 | null |
2024-05-06 | A Rate-Distortion-Classification Approach for Lossy Image Compression | Yuefeng Zhang et.al. | 2405.03500 | null |
2024-05-06 | Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition | Xitong Zhang et.al. | 2405.03089 | link |
2024-05-04 | Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos | Joaquim Comas et.al. | 2405.02652 | null |
2024-05-06 | Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | Jian Meng et.al. | 2405.01775 | link |
2024-05-02 | PointCompress3D – A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-28 | Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression | Li Wan et.al. | 2405.01584 | null |
2024-05-02 | GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression | Daxin Li et.al. | 2405.01170 | null |
2024-04-30 | Analysis and Enhancement of Lossless Image Compression in JPEG-XL | Rustam Mamedov et.al. | 2404.19755 | null |
2024-04-30 | EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization | Jianzong Wang et.al. | 2404.19214 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-28 | Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding | Weijie Bao et.al. | 2404.18058 | null |
2024-04-25 | Learning Visuotactile Skills with Two Multifingered Hands | Toru Lin et.al. | 2404.16823 | link |
2024-04-24 | Domain Adaptation for Learned Image Compression with Supervised Adapters | Alberto Presta et.al. | 2404.15591 | link |
2024-04-23 | One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices | Chao Chang et.al. | 2404.14783 | link |
2024-04-22 | Neural Compress-and-Forward for the Relay Channel | Ezgi Ozyilkan et.al. | 2404.14594 | null |
2024-04-22 | Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers | Sandeep Kumar et.al. | 2404.13886 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-18 | Image Compression and Reconstruction Based on Quantum Network | Xun Ji et.al. | 2404.11994 | null |
2024-04-17 | Spatio-Temporal Motion Retargeting for Quadruped Robots | Taerim Yoon et.al. | 2404.11557 | null |
2024-04-17 | Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Luca Bompani et.al. | 2404.11488 | link |
2024-04-17 | Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks | Eri Hosonuma et.al. | 2404.11280 | null |
2024-04-16 | Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning | Kyle Hsu et.al. | 2404.10282 | link |
2024-04-16 | Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression | Jixiang Luo et.al. | 2404.10234 | null |
2024-04-15 | One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing | Yueyu Hu et.al. | 2404.09979 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-18 | Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Tobias Weber et.al. | 2404.09683 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-17 | Incremental data compression for PDE-constrained optimization with a data assimilation application | Xuejian Li et.al. | 2404.09323 | null |
2024-04-14 | A Joint Data Compression and Time-Delay Estimation Method For Distributed Systems via Extremum Encoding | Amir Weiss et.al. | 2404.09244 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Miguel Ortiz del Castillo et.al. | 2404.08399 | null |
2024-04-11 | Video Compression Beyond VVC: Quantitative Analysis of Intra Coding Tools in Enhanced Compression Model (ECM) | Mohsen Abdoli et.al. | 2404.07872 | null |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-14 | A comparison between Shapefit compression and Full-Modelling method with PyBird for DESI 2024 and beyond | Y. Lai et.al. | 2404.07283 | link |
2024-04-10 | Exploring Repetitiveness Measures for Two-Dimensional Strings | Giuseppe Romana et.al. | 2404.07030 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | link |
2024-04-09 | Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey | Feng Liang et.al. | 2404.06114 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | Task-Aware Encoder Control for Deep Video Compression | Xingtong Ge et.al. | 2404.04848 | null |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-05 | ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing | Alec Helbling et.al. | 2404.04376 | link |
2024-04-03 | Convolutional variational autoencoders for secure lossy image compression in remote sensing | Alessandro Giuliano et.al. | 2404.03696 | null |
2024-03-25 | RL for Consistency Models: Faster Reward Guided Text-to-Image Generation | Owen Oertell et.al. | 2404.03673 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning | Tyler Chang et.al. | 2404.03586 | link |
2024-04-04 | Semantic Compression with Information Lattice Learning | Haizi Yu et.al. | 2404.03131 | null |
2024-04-01 | Accounting for contact network uncertainty in epidemic inferences with Approximate Bayesian Computation | Maxwell H. Wang et.al. | 2404.02924 | null |
2024-04-03 | Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory | Boris Ryabko et.al. | 2404.02708 | null |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms | Jiaang Duan et.al. | 2404.02445 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-03-31 | Metric dimensions of generalized Sierpiński graphs over squares | Savari Prabhu et.al. | 2404.00771 | null |
2024-03-27 | Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data | Daniel Menges et.al. | 2403.19721 | null |
2024-03-28 | RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation | Marian Invanov et.al. | 2403.19330 | null |
2024-03-28 | Uncertainty-Aware Deep Video Compression with Ensembles | Wufei Ma et.al. | 2403.19158 | null |
2024-04-08 | Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-25 | Neural Image Compression with Quantization Rectifier | Wei Luo et.al. | 2403.17236 | null |
2024-03-25 | Invertible Diffusion Models for Compressed Sensing | Bin Chen et.al. | 2403.17006 | null |
2024-03-25 | Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution | Francisco E Enríquez-Mier-y-Terán et.al. | 2403.16465 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-23 | Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets | Robert Underwood et.al. | 2403.15953 | null |
2024-03-23 | Droplet shape representation using Fourier series and autoencoders | Mihir Durve et.al. | 2403.15797 | null |
2024-03-21 | S2LIC: Learned Image Compression with the SwinV2 Block, Adaptive Channel-wise and Global-inter Attention Context | Yongqiang Wang et.al. | 2403.14471 | link |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-26 | Powerful Lossy Compression for Noisy Images | Shilv Cai et.al. | 2403.14135 | null |
2024-03-20 | String attractors and bi-infinite words | Pierre Béaur et.al. | 2403.13449 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-19 | Privacy-Preserving Face Recognition Using Trainable Feature Subtraction | Yuxi Mi et.al. | 2403.12457 | link |
2024-03-19 | VQ-NeRV: A Vector Quantized Neural Representation for Videos | Yunjie Xu et.al. | 2403.12401 | link |
2024-03-18 | Encoding of linear kinetic plasma problems in quantum circuits via data compression | Ivan Novikau et.al. | 2403.11989 | null |
2024-03-18 | Object Segmentation-Assisted Inter Prediction for Versatile Video Coding | Zhuoyuan Li et.al. | 2403.11694 | null |
2024-03-18 | Overfitted image coding at reduced complexity | Théophile Blard et.al. | 2403.11651 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-16 | Channel-wise Feature Decorrelation for Enhanced Learned Image Compression | Farhad Pakdaman et.al. | 2403.10936 | null |
2024-03-16 | NARRATE: Versatile Language Architecture for Optimal Control in Robotics | Seif Ismail et.al. | 2403.10762 | link |
2024-03-15 | Process-and-Forward: Deep Joint Source-Channel Coding Over Cooperative Relay Networks | Chenghong Bian et.al. | 2403.10613 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | link |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344 | link |
2024-03-14 | Noise Dimension of GAN: An Image Compression Perspective | Ziran Zhu et.al. | 2403.09196 | null |
2024-03-20 | Content-aware Masked Image Modeling Transformer for Stereo Image Compression | Xinjie Zhang et.al. | 2403.08505 | null |
2024-03-12 | Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding | Eric Lei et.al. | 2403.07320 | null |
2024-03-11 | Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI | Lang Tong et.al. | 2403.06942 | null |
2024-03-16 | Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression | Zhi Cao et.al. | 2403.06700 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Blockchain-Enabled Variational Information Bottleneck for IoT Networks | Qiong Wu et.al. | 2403.06129 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-07 | Complexity-constrained quantum thermodynamics | Anthony Munson et.al. | 2403.04828 | null |
2024-03-07 | Image Coding for Machines with Edge Information Learning Using Segment Anything | Takahiro Shindo et.al. | 2403.04173 | link |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | ZF Beamforming Tensor Compression for Massive MIMO Fronthaul | Libin Zheng et.al. | 2403.03675 | null |
2024-03-06 | Space Complexity of Euclidean Clustering | Xiaoyi Zhu et.al. | 2403.02971 | null |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-04 | Dark Energy Survey Year 3 results: likelihood-free, simulation-based $w$ CDM inference with neural compression of weak-lensing map statistics | N. Jeffrey et.al. | 2403.02314 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-03 | On the Compressibility of Quantized Large Language Models | Yu Mao et.al. | 2403.01384 | null |
2024-03-02 | Towards Accurate Lip-to-Speech Synthesis in-the-Wild | Sindhu Hegde et.al. | 2403.01087 | null |
2024-03-01 | Region-Adaptive Transform with Segmentation Prior for Image Compression | Yuxi Liu et.al. | 2403.00628 | link |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Mahsa Mozafari-Nia et.al. | 2403.00155 | null |
2024-02-29 | Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling | Wenxue Cui et.al. | 2402.19111 | null |
2024-02-29 | Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets | Fatih Kamisli et.al. | 2402.18930 | link |
2024-02-29 | Towards Backward-Compatible Continual Learning of Image Compression | Zhihao Duan et.al. | 2402.18862 | link |
2024-02-29 | Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression | Xinyue Li et.al. | 2402.18761 | null |
2024-01-10 | Motion Guided Token Compression for Efficient Masked Video Modeling | Yukun Feng et.al. | 2402.18577 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | link |
2024-02-28 | NERV++: An Enhanced Implicit Neural Video Representation | Ahmed Ghorbel et.al. | 2402.18305 | null |
2024-02-28 | Computing Minimal Absent Words and Extended Bispecial Factors with CDAWG Space | Shunsuke Inenaga et.al. | 2402.18090 | null |
2024-03-03 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | $ζ$ -QVAE: A Quantum Variational Autoencoder utilizing Regularized Mixed-state Latent Representations | Gaoyuan Wang et.al. | 2402.17749 | null |
2024-02-27 | Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model | Panqi Jia et.al. | 2402.17487 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-29 | Neural Video Compression with Feature Modulation | Jiahao Li et.al. | 2402.17414 | link |
2024-01-19 | MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network | Yujun Huang et.al. | 2402.16855 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-02-26 | Enabling robust sensor network design with data processing and optimization making use of local beehive image and video files | Ephrance Eunice Namugenyi et.al. | 2402.16655 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction | Wen-Yang Lu et.al. | 2402.16371 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-24 | Traditional Transformation Theory Guided Model for Learned Image Compression | Zhiyuan Li et.al. | 2402.15744 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-21 | Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel | Jordan Dotzel et.al. | 2402.13536 | null |
2024-02-20 | Compressing the two-particle Green’s function using wavelets: Theory and application to the Hubbard atom | Emin Moghadas et.al. | 2402.13030 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Transformer-based Learned Image Compression for Joint Decoding and Denoising | Yi-Hsin Chen et.al. | 2402.12888 | null |
2024-02-19 | Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Philip Müller et.al. | 2402.11985 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-18 | Learning to Learn Faster from Human Feedback with Language Model Predictive Control | Jacky Liang et.al. | 2402.11450 | null |
2024-02-17 | TinyLIC-High efficiency lossy image compression method | Gaocheng Ma et.al. | 2402.11164 | null |
2024-02-15 | Analysis of Neural Video Compression Networks for 360-Degree Video Coding | Andy Regensky et.al. | 2402.10257 | null |
2024-02-14 | Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion | Edgar Heinert et.al. | 2402.09530 | link |
2024-02-14 | A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders | Matthias Kränzler et.al. | 2402.09001 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression | Oguzhan Gungordu et.al. | 2402.08862 | null |
2024-02-13 | Learned Image Compression with Text Quality Enhancement | Chih-Yu Lai et.al. | 2402.08643 | null |
2024-02-13 | Motion-Adaptive Inference for Flexible Learned B-Frame Compression | M. Akin Yilmaz et.al. | 2402.08550 | null |
2024-02-21 | A Neural-network Enhanced Video Coding Framework beyond ECM | Yanchen Zhao et.al. | 2402.08397 | null |
2024-02-13 | Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss | Kei Iino et.al. | 2402.08267 | null |
2024-02-12 | Distributed Compression in the Era of Machine Learning: A Review of Recent Advances | Ezgi Ozyilkan et.al. | 2402.07997 | null |
2024-02-13 | Towards Meta-Pruning via Optimal Transport | Alexander Theus et.al. | 2402.07839 | link |
2024-02-09 | Parameter estimation for quantum jump unraveling | Marco Radaelli et.al. | 2402.06556 | link |
2024-02-07 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications | Christian D. Rask et.al. | 2402.05974 | null |
2024-02-08 | Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Onur G. Guleryuz et.al. | 2402.05887 | link |
2024-02-08 | Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs | Yuxin Xie et.al. | 2402.05582 | null |
2024-02-05 | TexShape: Information Theoretic Sentence Embedding for Language Models | H. Kaan Kale et.al. | 2402.05132 | link |
2024-02-07 | Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth | Kevin Kögler et.al. | 2402.05013 | null |
2024-02-06 | A Novel Local and Hyper-Local Multicast Services Transmission Scheme for Beyond 5G Networks | Sweta Singh et.al. | 2402.03963 | null |
2024-02-06 | Cool-chic video: Learned video coding with 800 parameters | Thomas Leguay et.al. | 2402.03179 | link |
2024-02-05 | Perceptual Learned Image Compression via End-to-End JND-Based Optimization | Farhad Pakdaman et.al. | 2402.02836 | null |
2024-02-04 | Discovering More Effective Tensor Network Structure Search Algorithms via Large Language Models (LLMs) | Junhua Zeng et.al. | 2402.02456 | link |
2024-03-04 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | Nikolaos Stathoulopoulos et.al. | 2402.02192 | null |
2024-02-03 | Generative Visual Compression: A Review | Bolin Chen et.al. | 2402.02140 | null |
2024-02-23 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-02 | UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding | Jiayu Yang et.al. | 2402.01289 | null |
2024-02-02 | Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training | Sota Kudo et.al. | 2402.01238 | link |
2024-02-02 | The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3 | Giulio Eulisse et.al. | 2402.01205 | null |
2024-02-01 | Compressed image quality assessment using stacking | S. Farhad Hosseini-Benvidi et.al. | 2402.00993 | null |
2024-02-04 | Evaluating Large Language Models for Generalization and Robustness via Data Compression | Yucheng Li et.al. | 2402.00861 | link |
2024-03-11 | LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression | Wei Jiang et.al. | 2402.00680 | null |
2024-02-01 | Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations | Vignesh V Menon et.al. | 2402.00622 | null |
2024-01-31 | EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Dong Chen et.al. | 2402.00084 | null |
2024-01-31 | A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 | Darren Ramsook et.al. | 2401.18021 | null |
2024-01-31 | Robustly overfitting latents for flexible neural image compression | Yura Perugachi-Diaz et.al. | 2401.17789 | null |
2024-01-30 | A Group Theoretic Metric for Robot State Estimation Leveraging Chebyshev Interpolation | Varun Agrawal et.al. | 2401.17463 | null |
2024-01-30 | SLIC: A Learned Image Codec Using Structure and Color | Srivatsa Prativadibhayankaram et.al. | 2401.17246 | link |
2024-01-30 | Large Language Model Evaluation via Matrix Entropy | Lai Wei et.al. | 2401.17139 | link |
2024-01-30 | Local integrals of motion in dipole-conserving models with Hilbert space fragmentation | Patrycja Łydżba et.al. | 2401.17097 | null |
2024-01-29 | On Channel Simulation with Causal Rejection Samplers | Daniel Goc et.al. | 2401.16579 | null |
2024-01-29 | Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression | Xihua Sheng et.al. | 2401.15864 | null |
2024-01-29 | Bayesian one- and two-sided inference on the local effective dimension | Eduard Belitser et.al. | 2401.15816 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-26 | Shadow simulation of quantum processes | Xuanqiang Zhao et.al. | 2401.14934 | null |
2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | Jon Alvarez Justo et.al. | 2401.14786 | null |
2024-01-26 | A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction | Jon Alvarez Justo et.al. | 2401.14762 | null |
2024-01-26 | Residual Quantization with Implicit Neural Codebooks | Iris Huijben et.al. | 2401.14732 | link |
2024-01-25 | Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression | Daxin Li et.al. | 2401.14007 | null |
2024-02-07 | Perceptual-oriented Learned Image Compression with Dynamic Kernel | Nianxiang Fu et.al. | 2401.13967 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-01-24 | FLLIC: Functionally Lossless Image Compression | Xi Zhang et.al. | 2401.13616 | null |
2024-01-23 | Fast Implicit Neural Representation Image Codec in Resource-limited Devices | Xiang Liu et.al. | 2401.12587 | null |
2024-01-22 | PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression | Aaron Hurst et.al. | 2401.12018 | null |
2024-01-22 | A Training-Free Defense Framework for Robust Learned Image Compression | Myungseo Song et.al. | 2401.11902 | null |
2024-01-21 | Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding | Yichi Zhang et.al. | 2401.11615 | null |
2024-01-21 | ColorVideoVDP: A visual difference predictor for image, video and display distortions | Rafal K. Mantiuk et.al. | 2401.11485 | link |
2024-01-21 | Data-driven compression of electron-phonon interactions | Yao Luo et.al. | 2401.11393 | null |
2024-01-20 | Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding | Haisheng Fu et.al. | 2401.11093 | null |
2024-01-19 | NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines | Jukka I. Ahonen et.al. | 2401.10761 | null |
2024-01-19 | Bridging the gap between image coding for machines and humans | Nam Le et.al. | 2401.10732 | null |
2024-01-18 | Attack and Defense Analysis of Learned Image Compression | Tianyu Zhu et.al. | 2401.10345 | null |
2024-01-18 | Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan et.al. | 2401.10217 | null |
2024-01-18 | Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera | Ido Zuckerman et.al. | 2401.10037 | null |
2024-01-18 | Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors | Pao-Sheng Vincent Sun et.al. | 2401.09797 | null |
2024-01-18 | Compressing MIMO Channel Submatrices with Tucker Decomposition: Enabling Efficient Storage and Reducing SINR Computation Overhead | Yuanwei Zhang et.al. | 2401.09792 | null |
2024-01-17 | Idempotence and Perceptual Image Compression | Tongda Xu et.al. | 2401.08920 | link |
2024-01-16 | End-to-End Optimized Image Compression with the Frequency-Oriented Transform | Yuefeng Zhang et.al. | 2401.08194 | null |
2024-01-17 | Learned Image Compression with ROI-Weighted Distortion and Bit Allocation | Wei Jiang et.al. | 2401.08154 | null |
2024-01-15 | Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | Manish Sharma et.al. | 2401.08014 | null |
2024-01-15 | Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models | Dan Jacobellis et.al. | 2401.07957 | link |
2024-01-14 | Exploring Compressed Image Representation as a Perceptual Proxy: A Study | Chen-Hsiu Huang et.al. | 2401.07200 | link |
2024-01-13 | Progressive Feature Fusion Network for Enhancing Image Quality Assessment | Kaiqun Wu et.al. | 2401.06992 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images – Part II: Spatial and Tonal Data Optimization | Niklas Kämper et.al. | 2401.06747 | null |
2024-03-18 | LiDAR Depth Map Guided Image Compression Model | Alessandro Gnutti et.al. | 2401.06517 | null |
2024-01-11 | Transformer Masked Autoencoders for Next-Generation Wireless Communications: Architecture and Opportunities | Abdullah Zayat et.al. | 2401.06274 | null |
2024-01-11 | MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring | Qian Gong et.al. | 2401.05994 | null |
2024-01-10 | SnapCap: Efficient Snapshot Compressive Video Captioning | Jianqiao Sun et.al. | 2401.04903 | null |
2024-01-09 | Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression | Ramin Goudarzi Karim et.al. | 2401.04670 | null |
2024-01-09 | Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation | Jinhai Yang et.al. | 2401.04405 | null |
2024-01-08 | Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion | Minglong Xue et.al. | 2401.03788 | link |
2024-01-08 | A Video Coding Method Based on Neural Network for CLIC2024 | Zhengang Li et.al. | 2401.03623 | null |
2024-01-06 | Spatiotemporally adaptive compression for scientific dataset with feature preservation – a case study on simulation data with extreme climate events analysis | Qian Gong et.al. | 2401.03317 | null |
2024-01-06 | Comparison of spectrum models as applied to single-particle $\bf p_t$ spectra from high-energy p-p collisions and their physical interpretations | Thomas A. Trainor et.al. | 2401.03290 | null |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-05 | MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS) | Youhao Yu et.al. | 2401.02884 | null |
2024-03-08 | Importance Matching Lemma for Lossy Compression with Side Information | Buu Phan et.al. | 2401.02609 | null |
2024-01-04 | Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder | Théo Ladune et.al. | 2401.02156 | link |
2024-01-04 | ED: Perceptually tuned Enhanced Compression Model | Pierrick Philippe et.al. | 2401.02145 | null |
2024-01-02 | NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement | Parham Zilouchian Moghaddam et.al. | 2401.01163 | null |
2024-01-28 | Higher-Order Cellular Automata Generated Symmetry-Protected Topological Phases and Detection Through Multi-Point Strange Correlators | Jie-Yu Zhang et.al. | 2401.00505 | null |
2023-12-28 | Selective Run-Length Encoding | Xutan Peng et.al. | 2312.17024 | null |
2023-12-29 | FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information | Yichong Xia et.al. | 2312.16963 | null |
2023-12-26 | Range Entropy Queries and Partitioning | Sanjay Krishnan et.al. | 2312.15959 | null |
2023-12-25 | MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression | Yi-Hsin Chen et.al. | 2312.15829 | null |
2023-12-25 | On Robust Wasserstein Barycenter: The Model and Algorithm | Xu Wang et.al. | 2312.15762 | null |
2023-12-25 | Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision | Qi Mao et.al. | 2312.15622 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792 | null |
2024-01-09 | Enhanced Color Palette Modeling for Lossless Screen Content Compression | Hannah Och et.al. | 2312.14491 | null |
2023-12-30 | Efficient Communication in Federated Learning Using Floating-Point Lossy Compression | Grant Wilkins et.al. | 2312.13461 | null |
2023-12-19 | A Huffman based short message service compression technique using adjacent distance array | Pranta Sarker et.al. | 2312.12495 | null |
2023-12-19 | Full-reference Video Quality Assessment for User Generated Content Transcoding | Zihao Qi et.al. | 2312.12317 | null |
2023-12-19 | Low-Consumption Partial Transcoding by HEVC | Mohsen Abdoli et.al. | 2312.12174 | link |
2023-12-19 | Comparative Study of Hardware and Software Power Measurements in Video Compression | Angeliki Katsenou et.al. | 2312.12150 | null |
2023-12-18 | Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication | Hyunmin Choi et.al. | 2312.11575 | link |
2024-01-11 | Quantized Decoder in Learned Image Compression for Deterministic Reconstruction | Esin Koyuncu et.al. | 2312.11209 | null |
2023-12-19 | A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network | Siyu Zhang et.al. | 2312.10716 | null |
2023-12-17 | IntraSeismic: a coordinate-based learning approach to seismic inversion | Juan Romero et.al. | 2312.10568 | null |
2023-12-17 | Light-weight CNN-based VVC Inter Partitioning Acceleration | Yiqun Liu et.al. | 2312.10567 | null |
2023-12-16 | Statistical Analysis of Inter Coding in VVC Test Model (VTM) | Yiqun Liu et.al. | 2312.10406 | null |
2023-12-15 | IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding | Yu-Han Sun et.al. | 2312.09799 | null |
2023-12-15 | Towards Neuromorphic Compression based Neural Sensing for Next-Generation Wireless Implantable Brain Machine Interface | Vivek Mohan et.al. | 2312.09503 | null |
2023-12-14 | Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression | Andy Regensky et.al. | 2312.09266 | link |
2023-12-14 | Efficient Online Learning of Contact Force Models for Connector Insertion | Kevin Tracy et.al. | 2312.09190 | null |
2023-12-13 | Balanced and Deterministic Weight-sharing Helps Network Performance | Oscar Chang et.al. | 2312.08401 | null |
2023-12-13 | Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach | Yiqun Liu et.al. | 2312.08330 | null |
2023-12-13 | CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation | Eugenio Chisari et.al. | 2312.08240 | null |
2023-12-13 | Explainable Trajectory Representation through Dictionary Learning | Yuanbo Tang et.al. | 2312.08052 | null |
2023-12-12 | Deep Hierarchical Video Compression | Ming Lu et.al. | 2312.07126 | null |
2023-12-12 | Communication Cost Reduction for Subgraph Counting under Local Differential Privacy via Hash Functions | Quentin Hillebrand et.al. | 2312.07055 | link |
2023-12-11 | RAFIC: Retrieval-Augmented Few-shot Image Classification | Hangfei Lin et.al. | 2312.06868 | link |
2023-12-11 | A New Projection Pursuit Index for Big Data | Yajie Duan et.al. | 2312.06465 | null |
2023-12-11 | Variational Auto-Encoder Based Deep Learning Technique For Filling Gaps in Reacting PIV Data | Shashank Yellapantula et.al. | 2312.06461 | null |
2023-12-07 | Analysis of Coding Gain Due to In-Loop Reshaping | Chau-Wai Wong et.al. | 2312.04022 | null |
2023-12-05 | C3: High-performance and low-complexity neural compression from a single image or video | Hyunjik Kim et.al. | 2312.02753 | null |
2023-12-05 | Unified learning-based lossy and lossless JPEG recompression | Jianghui Zhang et.al. | 2312.02705 | null |
2023-12-05 | Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation | Tianhao Peng et.al. | 2312.02605 | null |
2023-12-04 | Hyperspectral Image Compression Using Sampling and Implicit Neural Representations | Shima Rezasoltani et.al. | 2312.01558 | null |
Quality Assessment
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions | Nicolai Hermann et.al. | 2411.17489 | null |
2024-11-26 | Structure-Guided MR-to-CT Synthesis with Spatial and Semantic Alignments for Attenuation Correction of Whole-Body PET/MR Imaging | Jiaxu Zheng et.al. | 2411.17488 | null |
2024-11-26 | Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance | Jingtong Yue et.al. | 2411.17390 | link |
2024-11-26 | InsightEdit: Towards Better Instruction Following for Image Editing | Yingjing Xu et.al. | 2411.17323 | null |
2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | null |
2024-11-26 | Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment | Zheng Chen et.al. | 2411.17237 | link |
2024-11-26 | AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM | Jiarui Wang et.al. | 2411.17221 | link |
2024-11-26 | ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting | Chengyou Jia et.al. | 2411.17176 | null |
2024-11-26 | OSDFace: One-Step Diffusion Model for Face Restoration | Jingkai Wang et.al. | 2411.17163 | link |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-26 | 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction | Woong Oh Cho et.al. | 2411.17044 | null |
2024-11-26 | TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On | Zhenchen Wan et.al. | 2411.17017 | null |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-25 | Fully Automatic Deep Learning Pipeline for Whole Slide Image Quality Assessment | Falah Jabar et.al. | 2411.16885 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | null |
2024-11-25 | Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric | Zhichao Zhang et.al. | 2411.16619 | null |
2024-11-25 | Coherence Based Sound Speed Aberration Correction – with clinical validation in obstetric ultrasound | Anders Emil Vrålstad et.al. | 2411.16551 | null |
2024-11-25 | Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN | Elona Shatri et.al. | 2411.16405 | null |
2024-11-25 | Human-Calibrated Automated Testing and Validation of Generative Language Models | Agus Sudjianto et.al. | 2411.16391 | null |
2024-11-25 | Bounds for the maximum modulus of polynomial roots with nearly optimal worst-case overestimation | Prashant Batra et.al. | 2411.16385 | null |
2024-11-25 | Privacy-Preserving Federated Foundation Model for Generalist Ultrasound Artificial Intelligence | Yuncheng Jiang et.al. | 2411.16380 | null |
2024-11-25 | Sonic: Shifting Focus to Global Audio Perception in Portrait Animation | Xiaozhong Ji et.al. | 2411.16331 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | Weakly supervised image segmentation for defect-based grading of fresh produce | Manuel Knott et.al. | 2411.16219 | null |
2024-11-25 | VIRES: Video Instance Repainting with Sketch and Text Guidance | Shuchen Weng et.al. | 2411.16199 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-25 | ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images | Prithviraj Purushottam Naik et.al. | 2411.16096 | null |
2024-11-25 | AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity | Jili Xia et.al. | 2411.16087 | null |
2024-11-24 | Distribution models of antennas in radio astronomy: Efficiency comparison of the golden spiral interferometry | Elio Quiroga Rodriguez et.al. | 2411.15904 | null |
2024-11-24 | A review on Machine Learning based User-Centric Multimedia Streaming Techniques | Monalisa Ghosh et.al. | 2411.15801 | null |
2024-11-24 | LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration | Gaojing Zhang et.al. | 2411.15740 | null |
2024-11-23 | SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation | Jiayuan Zhu et.al. | 2411.15513 | null |
2024-11-23 | Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark | Rong-Cheng Tu et.al. | 2411.15488 | null |
2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | Information Extraction from Heterogenous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | Aniket Bhattacharyya et.al. | 2411.14957 | null |
2024-11-22 | Evaluating Vision Transformer Models for Visual Quality Control in Industrial Manufacturing | Miriam Alber et.al. | 2411.14953 | link |
2024-11-22 | Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System | Xu Chen et.al. | 2411.14837 | null |
2024-11-22 | BrightVAE: Luminosity Enhancement in Underexposed Endoscopic Images | Farzaneh Koohestani et.al. | 2411.14663 | null |
2024-11-22 | VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Armani Rodriguez et.al. | 2411.14642 | null |
2024-11-21 | Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection | Ali Awad et.al. | 2411.14626 | null |
2024-11-21 | Optimal Transcoding Preset Selection for Live Video Streaming | Zahra Nabizadeh et.al. | 2411.14613 | null |
2024-11-21 | Roadmap on Advances in Visual and Physiological Optics | Jesús E. Gómez-Correa et.al. | 2411.14606 | null |
2024-11-21 | Night-to-Day Translation via Illumination Degradation Disentanglement | Guanzhou Lan et.al. | 2411.14504 | null |
2024-11-21 | Regional Attention for Shadow Removal | Hengxing Liu et.al. | 2411.14201 | link |
2024-11-21 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-21 | Detecting Human Artifacts from Text-to-Image Models | Kaihong Wang et.al. | 2411.13842 | link |
2024-11-21 | Robust Steganography with Boundary-Preserving Overflow Alleviation and Adaptive Error Correction | Yu Cheng et.al. | 2411.13819 | null |
2024-11-21 | Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction | Zewei Xin et.al. | 2411.13787 | null |
2024-11-20 | What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality | Zihan Wang et.al. | 2411.13609 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-20 | OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging | Rajini Makam et.al. | 2411.13230 | null |
2024-11-20 | ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations | Xulong Zhang et.al. | 2411.13089 | null |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-19 | HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation | Abdul Basit Anees et.al. | 2411.12832 | link |
2024-11-19 | Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment | Siyi Pan et.al. | 2411.12791 | null |
2024-11-19 | Stochastic BIQA: Median Randomized Smoothing for Certified Blind Image Quality Assessment | Ekaterina Shumitskaya et.al. | 2411.12575 | null |
2024-11-19 | PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Joanna Kaleta et.al. | 2411.12510 | link |
2024-11-19 | A $\ell_2-\ell_p$ regulariser based model for Poisson noise removal using augmented Lagrangian method | Abdul Halim et.al. | 2411.12457 | null |
2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | null |
2024-11-19 | Acquire Precise and Comparable Fundus Image Quality Score: FTHNet and FQS Dataset | Zheng Gong et.al. | 2411.12273 | null |
2024-11-19 | Performance of Large Language Models in Technical MRI Question Answering: A Comparative Study | Alan B McMillan et.al. | 2411.12238 | null |
2024-11-19 | Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds | Arda Güçlü et.al. | 2411.12154 | null |
2024-11-18 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion | Meng Zhou et.al. | 2411.11799 | link |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | CLUE-MARK: Watermarking Diffusion Models using CLWE | Kareem Shehata et.al. | 2411.11434 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | null |
2024-11-17 | Enhanced Anime Image Generation Using USE-CMHSA-GAN | J. Lu et.al. | 2411.11179 | null |
2024-11-17 | Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion | Yu-Fei Shi et.al. | 2411.11123 | null |
2024-11-17 | MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild | Xi Fang et.al. | 2411.11098 | null |
2024-11-17 | Spectral Subspace Clustering for Attributed Graphs | Xiaoyang Lin et.al. | 2411.11074 | link |
2024-11-17 | Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification | Wenjia Jiang et.al. | 2411.11069 | null |
2024-11-17 | Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data | Priyabrata Karmakar et.al. | 2411.10924 | null |
2024-11-16 | HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings | Anton Alekseev et.al. | 2411.10724 | null |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | On the Foundation Model for Cardiac MRI Reconstruction | Chi Zhang et.al. | 2411.10403 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-15 | Block based Adaptive Compressive Sensing with Sampling Rate Control | Kosuke Iwama et.al. | 2411.10200 | null |
2024-11-15 | Visual question answering based evaluation metrics for text-to-image generation | Mizuki Miyamoto et.al. | 2411.10183 | null |
2024-11-15 | SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning | Zewen Chen et.al. | 2411.10161 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations | Jung-Woo Chang et.al. | 2411.10034 | null |
2024-11-14 | Video Denoising in Fluorescence Guided Surgery | Trevor Seets et.al. | 2411.09798 | null |
2024-11-14 | Research evaluation with ChatGPT: Is it age, country, length, or field biased? | Mike Thelwall et.al. | 2411.09768 | null |
2024-11-14 | Evaluating the Predictive Capacity of ChatGPT for Academic Peer Review Outcomes Across Multiple Platforms | Mike Thelwall et.al. | 2411.09763 | null |
2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging | Louise Friot-Giroux et.al. | 2411.09306 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space | Guanwen Feng et.al. | 2411.09268 | null |
2024-11-14 | JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation | Xuyang Cao et.al. | 2411.09209 | link |
2024-11-14 | Orthogonal Linear Array based Product Beamforming for Real Time Underwater 3D Acoustical Imaging | Mimisha M Menakath et.al. | 2411.09197 | null |
2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | null |
2024-11-13 | Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment | Zihao Huang et.al. | 2411.09007 | null |
2024-11-13 | Causal Explanations for Image Classifiers | Hana Chockler et.al. | 2411.08875 | link |
2024-11-13 | A novel imaging setup for hybrid radiotherapy tailored PET/MR in patients with head and neck cancer | R. M. Winter et.al. | 2411.08783 | null |
2024-11-13 | Robust Divergence Learning for Missing-Modality Segmentation | Runze Cheng et.al. | 2411.08305 | null |
2024-11-13 | Numerical Analysis of Lensless Imaging with Active Metasurfaces and Single-Pixel Detectors | Julie Belleville et.al. | 2411.08282 | null |
2024-11-12 | DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks | Zhaoxi Zhang et.al. | 2411.07941 | null |
2024-11-12 | Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization | Ziyu Shan et.al. | 2411.07936 | null |
2024-11-12 | CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising | Linxuan Li et.al. | 2411.07930 | null |
2024-11-12 | Joint multi-dimensional dynamic attention and transformer for general image restoration | Huan Zhang et.al. | 2411.07893 | link |
2024-11-12 | No-Reference Point Cloud Quality Assessment via Graph Convolutional Network | Wu Chen et.al. | 2411.07728 | null |
2024-11-12 | SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images | Bella Specktor-Fadida et.al. | 2411.07601 | null |
2024-11-12 | IR image databases generation under target intrinsic thermal variability constraints | Jerome Gilles et.al. | 2411.07577 | null |
2024-11-12 | Multi-task Feature Enhancement Network for No-Reference Image Quality Assessment | Li Yu et.al. | 2411.07556 | null |
2024-11-12 | A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation | Shengqi Chen et.al. | 2411.07503 | null |
2024-11-12 | An Exploration of Parallel Imaging System for Very-low Field (50mT) MRI Scanner | Lei Yang et.al. | 2411.07489 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study | Khadija Rais et.al. | 2411.07348 | null |
2024-11-11 | Artificial Intelligence-Informed Handheld Breast Ultrasound for Screening: A Systematic Review of Diagnostic Test Accuracy | Arianna Bunnell et.al. | 2411.07322 | null |
2024-11-11 | GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation | Haoyu Yang et.al. | 2411.07311 | null |
2024-11-11 | A Hierarchical Compression Technique for 3D Gaussian Splatting Compression | He Huang et.al. | 2411.06976 | null |
2024-11-11 | Multi-scale Frequency Enhancement Network for Blind Image Deblurring | Yawen Xiang et.al. | 2411.06893 | null |
2024-11-11 | Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation | Reo Yoneyama et.al. | 2411.06807 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | Loss-tolerant neural video codec aware congestion control for real time video communication | Zhengxu Xia et.al. | 2411.06742 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Accelerating Low-field MRI: Compressed Sensing and AI for fast noise-robust imaging | Efrat Shimron et.al. | 2411.06704 | link |
2024-11-10 | CASC: Condition-Aware Semantic Communication with Latent Diffusion Models | Weixuan Chen et.al. | 2411.06552 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings | Miguel Moura Ramos et.al. | 2411.05986 | null |
2024-11-08 | Dictionary Learning with Convolutional Structure for Seismic Data Denoising and Interpolation | Murad Almadani et.al. | 2411.05956 | null |
2024-11-08 | Alternative Learning Paradigms for Image Quality Transfer | Ahmed Karam Eldaly et.al. | 2411.05885 | null |
2024-11-08 | Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Asma Tanabene et.al. | 2411.05883 | null |
2024-11-08 | Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Long Truong To et.al. | 2411.05641 | null |
2024-11-08 | DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions | Rafael Berral-Soler et.al. | 2411.05552 | link |
2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | null |
2024-11-08 | RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction | Xingyu Ai et.al. | 2411.05354 | null |
2024-11-08 | Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning | Quang Truong Nguyen et.al. | 2411.05344 | null |
2024-11-08 | A Quality-Centric Framework for Generic Deepfake Detection | Wentang Song et.al. | 2411.05335 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-07 | Quantum Imaging and Metrology with Undetected squeezed Photons: Noise Canceling and Noise Based Imaging | S. Samimi et.al. | 2411.05175 | null |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-07 | Differentiable Gaussian Representation for Incomplete CT Reconstruction | Shaokai Wu et.al. | 2411.04844 | null |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-06 | Multi-Reward as Condition for Instruction-based Image Editing | Xin Gu et.al. | 2411.04713 | null |
2024-11-06 | SEE-DPO: Self Entropy Enhanced Direct Preference Optimization | Shivanshu Shekhar et.al. | 2411.04712 | null |
2024-11-07 | Generative Semantic Communications with Foundation Models: Perception-Error Analysis and Semantic-Aware Power Allocation | Chunmei Xu et.al. | 2411.04575 | null |
2024-11-07 | Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao et.al. | 2411.04424 | link |
2024-11-07 | A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment | Subrina Sultana et.al. | 2411.04379 | null |
2024-11-06 | X-ray Single-Pixel Imaging with MPGD-based detectors | M. Simões et.al. | 2411.03907 | null |
2024-11-06 | VQA $^2$ :Visual Question Answering for Video Quality Assessment | Ziheng Jia et.al. | 2411.03795 | null |
2024-11-06 | MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models | Wen-Chin Huang et.al. | 2411.03715 | link |
2024-11-06 | Evaluating Eye Tracking Signal Quality with Real-time Gaze Interaction Simulation | Mehedi Hasan Raju et.al. | 2411.03708 | null |
2024-11-06 | Investigation of Inward-Outward Ring Permanent Magnet Array for Portable Magnetic Resonance Imaging (MRI) | Ting-Ou Liang et.al. | 2411.03249 | null |
2024-11-05 | The Impact of Medicaid Expansion on Medicare Quality Measures | Hala Algrain et.al. | 2411.03140 | null |
2024-11-05 | Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes | Mads Svanborg Peters et.al. | 2411.03114 | null |
2024-11-05 | Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications | Lei Wang et.al. | 2411.02843 | null |
2024-11-04 | Interaction Design with Generative AI: An Empirical Study of Emerging Strategies Across the Four Phases of Design | Marie Muehlhaus et.al. | 2411.02662 | null |
2024-11-04 | Euclid: High-precision imaging astrometry and photometry from Early Release Observations. I. Internal kinematics of NGC 6397 by combining Euclid and Gaia data | M. Libralato et.al. | 2411.02487 | null |
2024-11-02 | Cross-D Conv: Cross-Dimensional Transferable Knowledge Base via Fourier Shifting Operation | Mehmet Can Yavuz et.al. | 2411.02441 | null |
2024-11-04 | Physically Based Neural Bidirectional Reflectance Distribution Function | Chenliang Zhou et.al. | 2411.02347 | null |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-03 | Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration | Xiaole Tang et.al. | 2411.01656 | link |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2411.01403 | null |
2024-11-02 | Interacting Large Language Model Agents. Interpretable Models and Social Learning | Adit Jain et.al. | 2411.01271 | null |
2024-11-02 | The impact of MRI image quality on statistical and predictive analysis on voxel based morphology | Felix Hoffstaedter et.al. | 2411.01268 | link |
2024-11-02 | Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures | Ameya Uppina et.al. | 2411.01251 | null |
2024-11-02 | Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting | Fengze Li et.al. | 2411.01218 | null |
2024-11-01 | Evaluation Metric for Quality Control and Generative Models in Histopathology Images | Pranav Jeevan et.al. | 2411.01034 | null |
2024-11-01 | Re-thinking Richardson-Lucy without Iteration Cutoffs: Physically Motivated Bayesian Deconvolution | Zachary H. Hendrix et.al. | 2411.00991 | null |
2024-11-01 | Inter-Feature-Map Differential Coding of Surveillance Video | Kei Iino et.al. | 2411.00984 | null |
2024-11-01 | Scalable AI Framework for Defect Detection in Metal Additive Manufacturing | Duy Nhat Phan et.al. | 2411.00960 | null |
2024-11-01 | Intensity Field Decomposition for Tissue-Guided Neural Tomography | Meng-Xun Li et.al. | 2411.00900 | null |
2024-11-01 | CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes | Yang Liu et.al. | 2411.00771 | null |
2024-11-01 | Face Anonymization Made Simple | Han-Wei Kung et.al. | 2411.00762 | link |
2024-11-01 | Demystifying the use of Compression in Virtual Production | Anil Kokaram et.al. | 2411.00547 | null |
2024-11-01 | MV-Adapter: Enhancing Underwater Instance Segmentation via Adaptive Channel Attention | Lianjun Liu et.al. | 2411.00472 | null |
2024-10-31 | IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision | Maxwell Meyer et.al. | 2411.00252 | null |
2024-10-31 | Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise | Yongxuan Yan et.al. | 2411.00199 | null |
2024-10-31 | Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Penghui Ruan et.al. | 2410.24219 | link |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Parameter choices in HaarPSI for IQA with medical images | Clemens Karner et.al. | 2410.24098 | link |
2024-10-31 | Advanced Predictive Quality Assessment for Ultrasonic Additive Manufacturing with Deep Learning Model | Lokendra Poudel et.al. | 2410.24055 | null |
2024-10-31 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Yihang Zhou et.al. | 2410.23962 | null |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-10-31 | Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data | Yucun Hou et.al. | 2410.23628 | null |
2024-10-31 | LBurst: Learning-Based Robotic Burst Feature Extraction for 3D Reconstruction in Low Light | Ahalya Ravendran et.al. | 2410.23522 | null |
2024-10-30 | Plug-and-play superiorization | Jon Henshaw et.al. | 2410.23401 | null |
2024-10-30 | Redundant Cross-Correlation for Drift Correction in SEM Nanoparticle Imaging | Iago Bischoff Montenegro et.al. | 2410.23390 | link |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-10-30 | AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection | Yujin Wang et.al. | 2410.22939 | null |
2024-10-30 | Prune and Repaint: Content-Aware Image Retargeting for any Ratio | Feihong Shen et.al. | 2410.22865 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-30 | Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models | Arash Marioriyad et.al. | 2410.22775 | null |
2024-10-30 | st-DTPM: Spatial-Temporal Guided Diffusion Transformer Probabilistic Model for Delayed Scan PET Image Prediction | Ran Hong et.al. | 2410.22732 | null |
2024-10-30 | FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution | Shuai Wang et.al. | 2410.22655 | null |
2024-10-31 | Consistency Diffusion Bridge Models | Guande He et.al. | 2410.22637 | null |
2024-10-29 | Deep Priors for Video Quality Prediction | Siddharath Narayan Shakya et.al. | 2410.22566 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI’s Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323 | null |
2024-10-29 | Multimodal Semantic Communication for Generative Audio-Driven Video Conferencing | Haonan Tong et.al. | 2410.22112 | null |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | null |
2024-10-29 | Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis | Deepak Sridhar et.al. | 2410.21638 | null |
2024-10-28 | Exploring the Design Space of Diffusion Bridge Models via Stochasticity Control | Shaorong Zhang et.al. | 2410.21553 | null |
2024-10-28 | SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han et.al. | 2410.21485 | link |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-28 | A Simple Yet Effective Corpus Construction Framework for Indonesian Grammatical Error Correction | Nankai Lin et.al. | 2410.20838 | link |
2024-10-28 | FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space | Yiyang Guo et.al. | 2410.20824 | null |
2024-10-28 | Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting | Jiawei Xu et.al. | 2410.20815 | null |
2024-10-28 | LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars | Xiaonuo Dongye et.al. | 2410.20789 | null |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | null |
2024-10-27 | Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Meng Wei et.al. | 2410.20593 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Enhancing Community Vision Screening – AI Driven Retinal Photography for Early Disease Detection and Patient Trust | Xiaofeng Lei et.al. | 2410.20309 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-26 | OAR-Weighted Dice Score: A spatially aware, radiosensitivity aware metric for target structure contour quality assessment | Lucas McCullum et.al. | 2410.20243 | null |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-25 | The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey | Benne W. Holwerda et.al. | 2410.19985 | null |
2024-10-25 | FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Zhengyao Lv et.al. | 2410.19355 | null |
2024-10-25 | Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion | Emiel Hoogeboom et.al. | 2410.19324 | null |
2024-10-24 | Optimising image capture for low-light widefield quantitative fluorescence microscopy | Zane Peterkovic et.al. | 2410.19210 | null |
2024-10-24 | Sort-free Gaussian Splatting via Weighted Sum Rendering | Qiqi Hou et.al. | 2410.18931 | null |
2024-10-24 | SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models | Zonghao Ying et.al. | 2410.18927 | null |
2024-10-24 | Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances | Shilin Lu et.al. | 2410.18775 | link |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-24 | ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks | Renshuai Tao et.al. | 2410.18687 | null |
2024-10-24 | Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data | Anup Shirgaonkar et.al. | 2410.18588 | null |
2024-10-24 | ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis | Zezhong Wang et.al. | 2410.18447 | null |
2024-10-24 | FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling | Zhengqiang Zhang et.al. | 2410.18410 | link |
2024-10-23 | Neural Cover Selection for Image Steganography | Karl Chahine et.al. | 2410.18216 | link |
2024-10-23 | In-Pixel Foreground and Contrast Enhancement Circuits with Customizable Mapping | Md Rahatul Islam Udoy et.al. | 2410.18052 | null |
2024-10-23 | Scalable Ranked Preference Optimization for Text-to-Image Generation | Shyamgopal Karthik et.al. | 2410.18013 | null |
2024-10-23 | Together We Can: Multilingual Automatic Post-Editing for Low-Resource Languages | Sourabh Deoghare et.al. | 2410.17973 | null |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-23 | TopoQA: a topological deep learning-based approach for protein complex structure interface quality assessment | Bingqing Han et.al. | 2410.17815 | null |
2024-10-23 | An Intelligent Agentic System for Complex Image Restoration Problems | Kaiwen Zhu et.al. | 2410.17809 | link |
2024-10-24 | Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets | Jesús Bobadilla et.al. | 2410.17651 | null |
2024-10-25 | Comprehensive Evaluation of Matrix Factorization Models for Collaborative Filtering Recommender Systems | Jesús Bobadilla et.al. | 2410.17644 | null |
2024-10-23 | Bilateral Hippocampi Segmentation in Low Field MRIs Using Mutual Feature Learning via Dual-Views | Himashi Peiris et.al. | 2410.17502 | link |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | Multispectral Texture Synthesis using RGB Convolutional Neural Networks | Sélim Ollivier et.al. | 2410.16019 | null |
2024-10-22 | Wireless Link Quality Estimation Using LSTM Model | Yuki Kanto et.al. | 2410.15357 | null |
2024-10-19 | A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Junjun Jiang et.al. | 2410.15067 | link |
2024-10-18 | DRACO: Differentiable Reconstruction for Arbitrary CBCT Orbits | Chengze Ye et.al. | 2410.14900 | link |
2024-10-18 | Dynamic Negative Guidance of Diffusion Models | Felix Koulischer et.al. | 2410.14398 | null |
2024-10-18 | Gaia Data Release 3: spectroscopic binary-star orbital solutions and the SB1 processing chain | E. Gosset et.al. | 2410.14372 | null |
2024-10-18 | 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization | Junan Chen et.al. | 2410.14343 | null |
2024-10-18 | Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques | Yugandhar Reddy Gogireddy et.al. | 2410.14285 | null |
2024-10-18 | Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization | Bin Lin et.al. | 2410.14283 | null |
2024-10-18 | Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From Printouts | Felix Krones et.al. | 2410.14185 | null |
2024-10-18 | Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping | Renguang Chen et.al. | 2410.14161 | null |
2024-10-17 | Generating Signed Language Instructions in Large-Scale Dialogue Systems | Mert İnan et.al. | 2410.14026 | null |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-15 | Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation | Renato Augusto Tavares et.al. | 2410.13622 | null |
2024-10-17 | L3DG: Latent 3D Gaussian Diffusion | Barbara Roessle et.al. | 2410.13530 | null |
2024-10-17 | Enhancing Crowdsourced Audio for Text-to-Speech Models | José Giraldo et.al. | 2410.13357 | null |
2024-10-17 | Active inference and deep generative modeling for cognitive ultrasound | Ruud JG van Sloun et.al. | 2410.13310 | null |
2024-10-17 | Latent Image and Video Resolution Prediction using Convolutional Neural Networks | Rittwika Kansabanik et.al. | 2410.13227 | null |
2024-10-17 | Anchored Alignment for Self-Explanations Enhancement | Luis Felipe Villa-Arenas et.al. | 2410.13216 | null |
2024-10-17 | Using RLHF to align speech enhancement approaches to mean-opinion quality scores | Anurag Kumar et.al. | 2410.13182 | null |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | SWIM: An Attention-Only Model for Speech Quality Assessment Under Subjective Variance | Imran E Kibria et.al. | 2410.12675 | null |
2024-10-16 | MambaPainter: Neural Stroke-Based Rendering in a Single Step | Tomoya Sawada et.al. | 2410.12524 | link |
2024-10-16 | Conditional Outcome Equivalence: A Quantile Alternative to CATE | Josh Givens et.al. | 2410.12454 | link |
2024-10-16 | Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation | Jiajie Yang et.al. | 2410.12414 | link |
2024-10-14 | Learnable Optimization-Based Algorithms for Low-Dose CT Reconstruction | Daisy Chen et.al. | 2410.11903 | null |
2024-10-15 | Generative Image Steganography Based on Point Cloud | Zhong Yangjie et.al. | 2410.11673 | null |
2024-10-15 | Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination | Arturo Salmi et.al. | 2410.11625 | null |
2024-10-15 | Rician Denoising Diffusion Probabilistic Models For Sodium Breast MRI Enhancement | Shuaiyu Yuan et.al. | 2410.11511 | null |
2024-10-15 | Visual-Geometric Collaborative Guidance for Affordance Learning | Hongchen Luo et.al. | 2410.11363 | link |
2024-10-15 | Evolutionary Retrofitting | Mathurin Videau et.al. | 2410.11330 | null |
2024-10-14 | Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation | Emmanouil Zaranis et.al. | 2410.10995 | null |
2024-10-14 | LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tianwei Xiong et.al. | 2410.10816 | link |
2024-10-14 | Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention | Dejia Xu et.al. | 2410.10774 | null |
2024-10-14 | LISAC: Learned Coded Waveform Design for ISAC with OFDM | Chenghong Bian et.al. | 2410.10711 | null |
2024-10-14 | A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery | Lucas Gonzalo Antonel et.al. | 2410.10488 | null |
2024-10-14 | Two-Stage Approach for Brain MR Image Synthesis: 2D Image Synthesis and 3D Refinement | Jihoon Cho et.al. | 2410.10269 | null |
2024-10-14 | Saliency Guided Optimization of Diffusion Latents | Xiwen Wang et.al. | 2410.10257 | null |
2024-10-14 | QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation | Gahyun Yoo et.al. | 2410.10228 | null |
2024-10-14 | Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Yongjin Yang et.al. | 2410.10166 | null |
2024-10-14 | StegaINR4MIH: steganography by implicit neural representation for multi-image hiding | Weina Dong et.al. | 2410.10117 | link |
2024-10-13 | Crowd IQ – Aggregating Opinions to Boost Performance | Michal Kosinski et.al. | 2410.10004 | null |
2024-10-13 | Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Lan Yao et.al. | 2410.09911 | link |
2024-10-13 | Two-Stage Human Verification using HandCAPTCHA and Anti-Spoofed Finger Biometrics with Feature Selection | Asish Bera et.al. | 2410.09866 | null |
2024-10-12 | Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework | Seung-Yeon Back et.al. | 2410.09529 | null |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | null |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-11 | TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning | Tsiry Mayet et.al. | 2410.09306 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Xuan Huang et.al. | 2410.08840 | link |
2024-10-11 | Towards virtual painting recolouring using Vision Transformer on X-Ray Fluorescence datacubes | Alessandro Bombini et.al. | 2410.08826 | null |
2024-10-11 | A Theoretical Framework for AI-driven data quality monitoring in high-volume data environments | Nikhil Bangad et.al. | 2410.08576 | null |
2024-10-11 | Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models | Pascl Zwick et.al. | 2410.08551 | link |
2024-10-11 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-10 | Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Qiuheng Wang et.al. | 2410.08260 | null |
2024-10-10 | Exploring ASR-Based Wav2Vec2 for Automated Speech Disorder Assessment: Insights and Analysis | Tuan Nguyen et.al. | 2410.08250 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | null |
2024-10-10 | Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency | Florian Hahlbohm et.al. | 2410.08129 | null |
2024-10-10 | Medical Image Quality Assessment based on Probability of Necessity and Sufficiency | Boyu Chen et.al. | 2410.08118 | null |
2024-10-10 | High-redshift LBG selection from broadband and wide photometric surveys using a Random Forest algorithm | C. Payerne et.al. | 2410.08062 | null |
2024-10-10 | Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal et.al. | 2410.07779 | null |
2024-10-10 | Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models | Danush Kumar Venkatesh et.al. | 2410.07753 | link |
2024-10-10 | Multi-Facet Counterfactual Learning for Content Quality Evaluation | Jiasheng Zheng et.al. | 2410.07693 | null |
2024-10-10 | DPL: Cross-quality DeepFake Detection via Dual Progressive Learning | Dongliang Zhang et.al. | 2410.07633 | null |
2024-10-10 | Rank Aggregation in Crowdsourcing for Listwise Annotations | Wenshui Luo et.al. | 2410.07538 | null |
2024-10-10 | A 3D-Printed Table for Hybrid X-ray CT and Optical Imaging of a Live Mouse | Wenxuan Xue et.al. | 2410.07517 | null |
2024-10-09 | An undetectable watermark for generative image models | Sam Gunn et.al. | 2410.07369 | link |
2024-10-09 | Secure Video Quality Assessment Resisting Adversarial Attacks | Ao-Xiang Zhang et.al. | 2410.06866 | null |
2024-10-09 | Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography | Qianqian Xue et.al. | 2410.06757 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds | Dongshuai Duan et.al. | 2410.06729 | link |
2024-10-09 | Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds | Juncheng Long et.al. | 2410.06689 | link |
2024-10-09 | SCOREQ: Speech Quality Assessment with Contrastive Regression | Alessandro Ragano et.al. | 2410.06675 | link |
2024-10-09 | InstantIR: Blind Image Restoration with Instant Generative Reference | Jen-Yuan Huang et.al. | 2410.06551 | null |
2024-10-08 | Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content? | Shenbin Qian et.al. | 2410.06338 | link |
2024-10-08 | Automated quality assessment using appearance-based simulations and hippocampus segmentation on low-field paediatric brain MR images | Vaanathi Sundaresan et.al. | 2410.06161 | link |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation | Boyuan Cao et.al. | 2410.06055 | link |
2024-10-08 | Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization | Wei Liu et.al. | 2410.06003 | link |
2024-10-08 | Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination | Yupeng Yang et.al. | 2410.05798 | link |
2024-10-08 | T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design | Jiachen Li et.al. | 2410.05677 | null |
2024-10-08 | Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning | Saemi Moon et.al. | 2410.05664 | null |
2024-10-08 | Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? | Xueru Wen et.al. | 2410.05584 | null |
2024-10-07 | Image Watermarks are Removable Using Controllable Regeneration from Clean Noise | Yepeng Liu et.al. | 2410.05470 | null |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | Towards a Modern and Lightweight Rendering Engine for Dynamic Robotic Simulations | Christopher John Allison et.al. | 2410.05095 | null |
2024-10-07 | Real-time cardiac cine MRI – A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions | Oliver Schad et.al. | 2410.04843 | null |
2024-10-07 | Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Zhiyu Zhu et.al. | 2410.04811 | link |
2024-10-07 | Transforming Color: A Novel Image Colorization Method | Hamza Shafiq et.al. | 2410.04799 | null |
2024-10-07 | CAR: Controllable Autoregressive Modeling for Visual Generation | Ziyu Yao et.al. | 2410.04671 | link |
2024-10-07 | Federated Learning Nodes Can Reconstruct Peers’ Image Data | Ethan Wilson et.al. | 2410.04661 | null |
2024-10-06 | Towards Unsupervised Blind Face Restoration using Diffusion Prior | Tianshu Kuai et.al. | 2410.04618 | null |
2024-10-06 | How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li et.al. | 2410.04545 | null |
2024-10-06 | VideoGuide: Improving Video Diffusion Models without Training Through a Teacher’s Guide | Dohun Lee et.al. | 2410.04364 | null |
2024-10-05 | Persona Knowledge-Aligned Prompt Tuning Method for Online Debate | Chunkit Chan et.al. | 2410.04239 | link |
2024-10-05 | AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results | Ivan Molodetskikh et.al. | 2410.04225 | null |
2024-10-05 | Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles | Md. Tarek Hasan et.al. | 2410.04202 | null |
2024-10-05 | Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model | Keda Tao et.al. | 2410.04161 | null |
2024-10-05 | Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT? | Àlex R. Atrio et.al. | 2410.04147 | null |
2024-10-05 | Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer | Aref Tabatabaei et.al. | 2410.04052 | null |
2024-10-04 | LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding | Doohyuk Jang et.al. | 2410.03355 | null |
2024-10-04 | CLOVE: Travelling Salesman’s approach to hyperbolic embeddings of complex networks with communities | Sámuel G. Balogh et.al. | 2410.03270 | null |
2024-10-04 | Parallel Corpus Augmentation using Masked Language Models | Vibhuti Kumari et.al. | 2410.03194 | null |
2024-10-04 | ECHOPulse: ECG controlled echocardio-grams video generation | Yiwei Li et.al. | 2410.03143 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | An Improved Variational Method for Image Denoising | Jing-En Huang et.al. | 2410.02587 | null |
2024-10-03 | Combining Pre- and Post-Demosaicking Noise Removal for RAW Video | Marco Sánchez-Beeckman et.al. | 2410.02572 | null |
2024-10-03 | Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment | Kai Liu et.al. | 2410.02505 | link |
2024-10-03 | Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models | Seyedmorteza Sadat et.al. | 2410.02416 | null |
2024-10-03 | Morphological evaluation of subwords vocabulary used by BETO language model | Óscar García-Sierra et.al. | 2410.02283 | null |
2024-10-03 | SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model | Kexin Zhang et.al. | 2410.02121 | null |
2024-10-02 | DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation | Jing He et.al. | 2410.02067 | null |
2024-10-02 | Impact of White-Box Adversarial Attacks on Convolutional Neural Networks | Rakesh Podder et.al. | 2410.02043 | null |
2024-10-02 | Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking | Aakash Varma Nadimpalli et.al. | 2410.01906 | null |
2024-10-02 | Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement | Shouvon Sarker et.al. | 2410.01869 | null |
2024-10-02 | ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation | Rinon Gal et.al. | 2410.01731 | null |
2024-10-04 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | null |
2024-10-02 | Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding | Yao Teng et.al. | 2410.01699 | link |
2024-10-02 | SAFE: Semantic Adaptive Feature Extraction with Rate Control for 6G Wireless Communications | Yuna Yan et.al. | 2410.01597 | null |
2024-10-02 | Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning | Martin F. Schiffner et.al. | 2410.01593 | null |
2024-10-02 | Imaging foundation model for universal enhancement of non-ideal measurement CT | Yuxin Liu et.al. | 2410.01591 | link |
2024-10-02 | HARMONI at ELT: tolerance analysis and expected as-build imaging performance of the infrared spectrograph | Eduard Muslimov et.al. | 2410.01581 | null |
2024-10-02 | Adaptive Radiofrequency Shimming in MRI using Reconfigurable Dielectric Materials | Paulina Šiurytė et.al. | 2410.01501 | null |
2024-10-02 | Quo Vadis RankList-based System in Face Recognition? | Xinyi Zhang et.al. | 2410.01498 | null |
2024-10-02 | Design of a custom wideband camera for MISTRAL imager-spectrograph | Eduard Muslimov et.al. | 2410.01414 | null |
2024-10-02 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-10-01 | Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency | Sitong Liu et.al. | 2410.01072 | null |
2024-10-01 | LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details | Jian Yang et.al. | 2410.00990 | null |
2024-10-01 | Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Streaming | Prajit T Rajendran et.al. | 2410.00849 | null |
2024-10-01 | Maximum entropy and quantized metric models for absolute category ratings | Dietmar Saupe et.al. | 2410.00817 | null |
2024-10-01 | Basis function compression for field probe monitoring | Paul Dubovan et.al. | 2410.00754 | null |
2024-10-01 | Development of the normalization method for the first large field-of-view plastic-based PET Modular scanner | A. Coussat et.al. | 2410.00669 | null |
2024-10-01 | Contribution of soundscape appropriateness to soundscape quality assessment in space: a mediating variable affecting acoustic comfort | Xinhao Yang et.al. | 2410.00667 | null |
2024-10-01 | AutoTM 2.0: Automatic Topic Modeling Framework for Documents Analysis | Maria Khodorchenko et.al. | 2410.00655 | null |
2024-10-01 | Dynamic and Scalable Data Preparation for Object-Centric Process Mining | Lien Bosmans et.al. | 2410.00596 | null |
2024-09-30 | UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation | Cheng Zhang et.al. | 2409.20197 | link |
2024-09-30 | Segmenting Wood Rot using Computer Vision Models | Roland Kammerbauer et.al. | 2409.20137 | null |
2024-09-30 | Machine Learning in Industrial Quality Control of Glass Bottle Prints | Maximilian Bundscherer et.al. | 2409.20132 | null |
2024-09-30 | Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs | Zicheng Zhang et.al. | 2409.20063 | null |
2024-09-30 | Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis | Hippolyte Gisserot-Boukhlef et.al. | 2409.20059 | null |
2024-10-01 | UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs | Yuho Lee et.al. | 2409.19898 | link |
2024-09-29 | OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines | Daniel Silver et.al. | 2409.19823 | null |
2024-09-29 | SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal | Fang Long et.al. | 2409.19679 | link |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-29 | High Quality Human Image Animation using Regional Supervision and Motion Blur Condition | Zhongcong Xu et.al. | 2409.19580 | null |
2024-09-27 | A comprehensive review and new taxonomy on superpixel segmentation | I. B. Barcelos et.al. | 2409.19179 | link |
2024-09-27 | Multimodal Pragmatic Jailbreak on Text-to-image Models | Tong Liu et.al. | 2409.19149 | null |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Align $^2$ LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation | Hongzhe Huang et.al. | 2409.18541 | link |
2024-09-27 | Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models | Nguyen Gia Bach et.al. | 2409.18476 | link |
2024-09-27 | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation | Jiawei Lu et.al. | 2409.18401 | null |
2024-09-27 | SinoSynth: A Physics-based Domain Randomization Approach for Generalizable CBCT Image Enhancement | Yunkui Pang et.al. | 2409.18355 | link |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-09-26 | Low Photon Number Non-Invasive Imaging Through Time-Varying Diffusers | Adrian Makowski et.al. | 2409.18072 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications | Jothi Prasanna Shanmuga Sundaram et.al. | 2409.18043 | null |
2024-09-26 | PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Xin Cai et.al. | 2409.17996 | null |
2024-09-26 | Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Qihan Huang et.al. | 2409.17920 | link |
2024-09-26 | Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization | Kaden Uhlig et.al. | 2409.17673 | null |
2024-09-26 | FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates | Nicola Pia et.al. | 2409.17635 | null |
2024-09-26 | Pixel-Space Post-Training of Latent Diffusion Models | Christina Zhang et.al. | 2409.17565 | null |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts | Mohammad Sadil Khan et.al. | 2409.17106 | null |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | The effect of image quality on galaxy merger identification with deep learning | Robert W. Bickley et.al. | 2409.17081 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Katharina Anderer et.al. | 2409.16765 | link |
2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | null |
2024-09-25 | In which fields can ChatGPT detect journal article quality? An evaluation of REF2021 results | Mike Thelwall et.al. | 2409.16695 | null |
2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | null |
2024-09-25 | Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts | Taehun Cha et.al. | 2409.16658 | link |
2024-09-25 | Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation | Siyin Wang et.al. | 2409.16644 | null |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Low Latency Point Cloud Rendering with Learned Splatting | Yueyu Hu et.al. | 2409.16504 | link |
2024-09-24 | A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Yue Chang et.al. | 2409.16494 | link |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-26 | Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients | Wanchen Zhao et.al. | 2409.16042 | null |
2024-09-24 | Deep chroma compression of tone-mapped images | Xenios Milidonis et.al. | 2409.16032 | link |
2024-09-24 | VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images | Jose Vargas Quiros et.al. | 2409.16016 | link |
2024-09-24 | Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality | Hannah Schieber et.al. | 2409.15959 | null |
2024-09-24 | Unsupervised dMRI Artifact Detection via Angular Resolution Enhancement and Cycle Consistency Learning | Sheng Chen et.al. | 2409.15883 | null |
2024-09-25 | Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data | Ligen Shi et.al. | 2409.15731 | null |
2024-09-23 | Blind Localization of Early Room Reflections with Arbitrary Microphone Array | Yogev Hadadi et.al. | 2409.15484 | null |
2024-09-23 | Simplifying Triangle Meshes in the Wild | Hsueh-Ti Derek Liu et.al. | 2409.15458 | null |
2024-09-23 | MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning | Yue Han et.al. | 2409.15179 | null |
2024-09-23 | Advancing Video Quality Assessment for AIGC | Xinli Yue et.al. | 2409.14888 | null |
2024-09-23 | Revisiting Video Quality Assessment from the Perspective of Generalization | Xinli Yue et.al. | 2409.14847 | link |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-23 | HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters | Lauri Juvela et.al. | 2409.14823 | null |
2024-09-22 | Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing | Wenze Ren et.al. | 2409.14554 | null |
2024-09-22 | Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting | Daniel A. Mitchell et.al. | 2409.14346 | null |
2024-09-22 | MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators | Qingyu Lu et.al. | 2409.14335 | null |
2024-09-22 | Quantitative and Qualitative Evaluation of NLM and Wavelet Methods in Image Enhancement | Cameron Khanpour et.al. | 2409.14334 | null |
2024-09-21 | JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Hadrien Reynaud et.al. | 2409.14149 | null |
2024-09-21 | N-Version Assessment and Enhancement of Generative AI | Marcus Kessel et.al. | 2409.14071 | null |
2024-09-18 | An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects | Zhizhou Jia et.al. | 2409.12096 | null |
2024-09-18 | Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement | Zizhen Lin et.al. | 2409.11725 | null |
2024-09-18 | DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion | Jian Xu et.al. | 2409.11642 | link |
2024-09-17 | Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision | Huidong Xie et.al. | 2409.11543 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks | Edgar Heinert et.al. | 2409.11373 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-17 | CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2409.10966 | link |
2024-09-17 | Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending | Yongyang Pan et.al. | 2409.10958 | null |
2024-09-17 | Neural Fields for Adaptive Photoacoustic Computed Tomography | Tianao Li et.al. | 2409.10876 | null |
2024-09-16 | Investigating Training Objectives for Generative Speech Enhancement | Julius Richter et.al. | 2409.10753 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | FGR-Net:Interpretable fundus imagegradeability classification based on deepreconstruction learning | Saif Khalid et.al. | 2409.10246 | null |
2024-09-16 | RF-GML: Reference-Free Generative Machine Listener | Arijit Biswas et.al. | 2409.10210 | null |
2024-09-16 | Towards Explainable Automated Data Quality Enhancement without Domain Knowledge | Djibril Sarr et.al. | 2409.10139 | null |
2024-09-16 | 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata et.al. | 2409.09969 | link |
2024-09-15 | A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink | Liz Izhikevich et.al. | 2409.09846 | null |
2024-09-15 | Underwater Image Enhancement via Dehazing and Color Restoration | Chengqin Wu et.al. | 2409.09779 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-15 | Superconducting and low temperature RF Coils for Ultra-Low-Field MRI: A Study on SNR Performance | Aditya A Bhosale et.al. | 2409.09608 | null |
2024-09-14 | Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans | Mohammed Munzer Dwedari et.al. | 2409.09387 | link |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | Confocal Raman Microscopy with Adaptive Optics | J. D. Munoz-Bolanos et.al. | 2409.08725 | null |
2024-09-13 | Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning | Tobias Wech et.al. | 2409.08619 | null |
2024-09-13 | DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge et.al. | 2409.08572 | link |
2024-09-13 | CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters | Wang Yinglong et.al. | 2409.08510 | link |
2024-09-12 | OpenACE: An Open Benchmark for Evaluating Audio Coding Performance | Jozef Coldenhoff et.al. | 2409.08374 | link |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-09-12 | OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation | Shun Zou et.al. | 2409.08000 | link |
2024-09-14 | Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment | Shaode Yu et.al. | 2409.07762 | null |
2024-09-11 | Foundation Models Boost Low-Level Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2409.07650 | null |
2024-09-11 | Machine Learning and Constraint Programming for Efficient Healthcare Scheduling | Aymen Ben Said et.al. | 2409.07547 | null |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | null |
2024-09-12 | 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents | Yingjie Zhou et.al. | 2409.07236 | link |
2024-09-11 | Phantom-based gradient waveform measurements with compensated variable-prephasing: Description and application to EPI at 7T | Hannah Scholten et.al. | 2409.07203 | null |
2024-09-11 | Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment | Mohammed Alsaafin et.al. | 2409.07115 | link |
2024-09-11 | CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion | Joshua Kazdan et.al. | 2409.07025 | null |
2024-09-11 | AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models | Boming Miao et.al. | 2409.07002 | null |
2024-09-10 | ExIQA: Explainable Image Quality Assessment Using Distortion Attributes | Sepehr Kazemi Ranjbar et.al. | 2409.06853 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements | Antonio Cuéllar et.al. | 2409.06548 | null |
2024-09-11 | AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval | Runqing Zhang et.al. | 2409.06385 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing | Kuang Yuan et.al. | 2409.06137 | null |
2024-09-09 | Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion | Fuxin Fan et.al. | 2409.05982 | null |
2024-09-09 | SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples | Haoyu Zhang et.al. | 2409.05595 | null |
2024-09-09 | Efficient Quality Estimation of True Random Bit-streams | Cesare Caratozzolo et.al. | 2409.05543 | null |
2024-09-09 | Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild | Xiongkuo Min et.al. | 2409.05540 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization | Xudong Li et.al. | 2409.05381 | null |
2024-09-09 | PersonaTalk: Bring Attention to Your Persona in Visual Dubbing | Longhao Zhang et.al. | 2409.05379 | null |
2024-09-09 | BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec | Detai Xin et.al. | 2409.05377 | link |
2024-09-09 | Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices | Yuanyi He et.al. | 2409.05297 | null |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography | Jiahao Zhu et.al. | 2409.04878 | null |
2024-09-07 | Metadata augmented deep neural networks for wild animal classification | Aslak Tøn et.al. | 2409.04825 | link |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE) | Shen Zhao et.al. | 2409.04353 | link |
2024-09-06 | Design and Characterization of MRI-compatible Plastic Ultrasonic Motor | Zhanyue Zhao et.al. | 2409.04006 | null |
2024-09-06 | Bi-modality Images Transfer with a Discrete Process Matching Method | Zhe Xiong et.al. | 2409.03977 | null |
2024-09-03 | Applications and Advances of Artificial Intelligence in Music Generation:A Review | Yanxu Chen et.al. | 2409.03715 | null |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation | Prerak Mody et.al. | 2409.03470 | link |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-05 | Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation | Brian Chao et.al. | 2409.03143 | null |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Coral Model Generation from Single Images for Virtual Reality Applications | Jie Fu et.al. | 2409.02376 | null |
2024-09-04 | Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI | Xuan Lei et.al. | 2409.02348 | null |
2024-09-03 | Coaching a Robotic Sonographer: Learning Robotic Ultrasound with Sparse Expert’s Feedback | Deepak Raina et.al. | 2409.02337 | null |
2024-09-03 | Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning | Xiaowei Hu et.al. | 2409.02108 | link |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | UWStereo: A Large Synthetic Dataset for Underwater Stereo Matching | Qingxuan Lv et.al. | 2409.01782 | null |
2024-09-03 | Boron Isotope Effects on Raman Scattering in Bulk BN, BP, and BAs: A Density-Functional Theory Study | Nima Ghafari Cherati et.al. | 2409.01671 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-03 | Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction | Liutao Yang et.al. | 2409.01544 | null |
2024-09-03 | Long-Range Biometric Identification in Real World Scenarios: A Comprehensive Evaluation Framework Based on Missions | Deniz Aykac et.al. | 2409.01540 | null |
2024-09-02 | Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions | Ryan Wen Liu et.al. | 2409.01500 | link |
2024-09-02 | Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement | Tathagata Bandyopadhyay et.al. | 2409.01352 | link |
2024-09-02 | A Roadmap to Holographic Focused Ultrasound Approaches to Generate Thermal Patterns | Ceren Cengiz et.al. | 2409.01323 | null |
2024-09-02 | Investigation of the spatial resolution of PET imaging system measuring polarization-correlated Compton events | Ana Marija Kožuljević et.al. | 2409.01238 | null |
2024-09-02 | MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation | Zewen Chen et.al. | 2409.01212 | link |
2024-09-02 | Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics | Tuong Vy Nguyen et.al. | 2409.01138 | null |
2024-09-02 | Rapid GPU-Based Pangenome Graph Layout | Jiajie Li et.al. | 2409.00876 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model | Nasim Jamshidi Avanaki et.al. | 2408.17057 | link |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Legacy Learning Using Few-Shot Font Generation Models for Automatic Text Design in Metaverse Content: Cases Studies in Korean and Chinese | Younghwi Kim et.al. | 2408.16900 | null |
2024-08-29 | The Continuous Electron Beam Accelerator Facility at 12 GeV | P. A. Adderley et.al. | 2408.16880 | null |
2024-08-29 | MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning | Nasim Jamshidi Avanaki et.al. | 2408.16879 | null |
2024-09-04 | Auto-resolving atomic structure at van der Waal interfaces using a generative model | Wenqiang Huang et.al. | 2408.16802 | link |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-09-02 | A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising | Shuaiyu Yuan et.al. | 2408.16481 | null |
2024-08-29 | LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Ye Yu et.al. | 2408.16235 | link |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Hengyu Zhou et.al. | 2408.15741 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Avoiding Generative Model Writer’s Block With Embedding Nudging | Ali Zand et.al. | 2408.15450 | null |
2024-09-02 | Pitfalls and Outlooks in Using COMET | Vilém Zouhar et.al. | 2408.15366 | link |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Taewoo Kim et.al. | 2408.14916 | link |
2024-08-27 | Alfie: Democratising RGBA Image Generation With No $$$ | Fabio Quattrini et.al. | 2408.14826 | link |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-26 | Gallery-Aware Uncertainty Estimation For Open-Set Face Recognition | Leonid Erlygin et.al. | 2408.14229 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-26 | LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models | Qihang Ge et.al. | 2408.14008 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-25 | Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella et.al. | 2408.13831 | link |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks | Nicholas S. DiBrita et.al. | 2408.13389 | link |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | A density ratio framework for evaluating the utility of synthetic data | Thom Benjamin Volker et.al. | 2408.13167 | null |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Variable Stars in M31 Stellar Clusters from the Panchromatic Hubble Andromeda Treasury | Richard Smith et.al. | 2408.12765 | null |
2024-08-22 | Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis | Memoona Aziz et.al. | 2408.12762 | null |
2024-08-22 | Unlocking Intrinsic Fairness in Stable Diffusion | Eunji Kim et.al. | 2408.12692 | null |
2024-08-22 | Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Shaoxiang Dang et.al. | 2408.12279 | null |
2024-08-21 | MBSS-T1: Model-Based Self-Supervised Motion Correction for Robust Cardiac T1 Mapping | Eyal Hanania et.al. | 2408.11992 | null |
2024-08-21 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-21 | Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Lodewijk Gelauff et.al. | 2408.11936 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Interpretable Long-term Action Quality Assessment | Xu Dong et.al. | 2408.11687 | link |
2024-08-21 | E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Shangkun Sun et.al. | 2408.11481 | link |
2024-08-21 | Fairness measures for biometric quality assessment | André Dörsch et.al. | 2408.11392 | null |
2024-08-21 | Gender Bias Evaluation in Text-to-image Generation: A Survey | Yankun Wu et.al. | 2408.11358 | null |
2024-08-21 | Image Score: Learning and Evaluating Human Preferences for Mercari Search | Chingis Oinar et.al. | 2408.11349 | null |
2024-08-21 | High-quality imaging of large areas through path-difference ptychography | Jizhe Cui et.al. | 2408.11332 | null |
2024-08-21 | Optimizing Transmit Field Inhomogeneity of Parallel RF Transmit Design in 7T MRI using Deep Learning | Zhengyi Lu et.al. | 2408.11323 | null |
2024-08-21 | Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods | David Jacob Kedziora et.al. | 2408.11322 | link |
2024-08-20 | Compress Guidance in Conditional Diffusion Sampling | Anh-Dung Dinh et.al. | 2408.11194 | null |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models | Hojat Asgariandehkordi et.al. | 2408.10987 | null |
2024-08-20 | Influence of Medical Foreign Bodies on Dark-Field Chest Radiographs: First experiences | Lennard Kaster et.al. | 2408.10855 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images | Wei Zhou et.al. | 2408.10134 | null |
2024-08-19 | Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement | Kang Xiao et.al. | 2408.09920 | link |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-18 | FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Ziyu Yao et.al. | 2408.09384 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-16 | Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming | Masoumeh Farhadi Nia et.al. | 2408.09044 | null |
2024-08-16 | Evaluating the Evaluator: Measuring LLMs’ Adherence to Task Evaluation Instructions | Bhuvanashree Murugadoss et.al. | 2408.08781 | null |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Visual-Friendly Concept Protection via Selective Adversarial Perturbations | Xiaoyue Mi et.al. | 2408.08518 | link |
2024-08-16 | Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Mohammadreza Samadi et.al. | 2408.08495 | null |
2024-08-15 | Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment | Daniele Rege Cambrin et.al. | 2408.08396 | link |
2024-08-15 | METR: Image Watermarking with Large Number of Unique Messages | Alexander Varlamov et.al. | 2408.08340 | link |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective | Zixuan Pan et.al. | 2408.08228 | link |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment | Zongzong Wu et.al. | 2408.08088 | null |
2024-08-15 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-15 | MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Lucas Nedel Kirsten et.al. | 2408.07932 | link |
2024-08-14 | New Curriculum, New Chance – Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation | Simon Kloker et.al. | 2408.07542 | null |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Tao Sun et.al. | 2408.07388 | null |
2024-08-13 | Direction of Arrival Correction through Speech Quality Feedback | Caleb Rascon et.al. | 2408.07234 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | BVI-UGC: A Video Quality Database for User-Generated Content Transcoding | Zihao Qi et.al. | 2408.07171 | null |
2024-08-13 | Efficient Deep Model-Based Optoacoustic Image Reconstruction | Christoph Dehner et.al. | 2408.07109 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | Evaluating Research Quality with Large Language Models: An Analysis of ChatGPT’s Effectiveness with Different Settings and Inputs | Mike Thelwall et.al. | 2408.06752 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses | Zhongweiyang Xu et.al. | 2408.06468 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation | Seungyeon Seo et.al. | 2408.06044 | link |
2024-08-12 | A Sharpness Based Loss Function for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2408.06014 | link |
2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link |
2024-08-12 | Creating Arabic LLM Prompts at Scale | Abdelrahman El-Sheikh et.al. | 2408.05882 | null |
2024-08-11 | LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei et.al. | 2408.05868 | null |
2024-08-14 | Iterative Improvement of an Additively Regularized Topic Model | Alex Gorbulev et.al. | 2408.05840 | null |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-11 | Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Yifan Pu et.al. | 2408.05710 | link |
2024-08-11 | Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets | Ghazal Kaviani et.al. | 2408.05697 | null |
2024-08-09 | CBCT scatter correction with dual-layer flat-panel detector | Xin Zhang et.al. | 2408.04943 | null |
2024-08-09 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing – A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-11 | Synchronous Multi-modal Semantic Communication System with Packet-level Coding | Yun Tian et.al. | 2408.04535 | null |
2024-08-08 | Robustness investigation of quality measures for the assessment of machine learning models | Thomas Most et.al. | 2408.04391 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | null |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Global-Local Progressive Integration Network for Blind Image Quality Assessment | Xiaoqi Wang et.al. | 2408.03885 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Eirini Cholopoulou et.al. | 2408.03734 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-07 | D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods | Onkar Susladkar et.al. | 2408.03558 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-06 | Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI | Alp G. Cicimen et.al. | 2408.03216 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-08-05 | VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Zhiyu Tan et.al. | 2408.02629 | null |
2024-08-05 | Cascading Refinement Video Denoising with Uncertainty Adaptivity | Xinyuan Yu et.al. | 2408.02284 | null |
2024-08-04 | PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu et.al. | 2408.02157 | null |
2024-08-06 | RICA2: Rubric-Informed, Calibrated Assessment of Actions | Abrar Majeedi et.al. | 2408.02138 | link |
2024-08-04 | View-consistent Object Removal in Radiance Fields | Yiren Lu et.al. | 2408.02100 | null |
2024-08-04 | Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity | Krishna Srikar Durbha et.al. | 2408.01932 | null |
2024-08-03 | Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Jintao Tan et.al. | 2408.01732 | null |
2024-08-03 | JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model | Farzaneh Jafari et.al. | 2408.01627 | null |
2024-08-02 | Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics | Alexander Gushchin et.al. | 2408.01541 | link |
2024-08-02 | Underwater Object Detection Enhancement via Channel Stabilization | Muhammad Ali et.al. | 2408.01293 | link |
2024-08-02 | Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement | Wenbin Zou et.al. | 2408.01276 | link |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-02 | Validation of an Analysability Model in Hybrid Quantum Software | Díaz-Muñoz Ana et.al. | 2408.01105 | null |
2024-08-06 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-01 | SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement | Mark Boss et.al. | 2408.00653 | null |
2024-08-01 | Regional quality estimation for echocardiography using deep learning | Gilles Van De Vyver et.al. | 2408.00591 | link |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-08-01 | RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace | Lu Ou et.al. | 2408.00294 | null |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model | Zhichao Zhang et.al. | 2407.21408 | null |
2024-07-31 | An all-sky catalogue of stellar reddening values | E. Paunzen et.al. | 2407.21373 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-08-01 | Outlier Detection in Large Radiological Datasets using UMAP | Mohammad Tariqul Islam et.al. | 2407.21263 | link |
2024-07-30 | MP-You: A Web-based MPI Simulation Tool | The-Vinh Tran-Luu et.al. | 2407.21155 | null |
2024-07-30 | Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition | Yuancheng Jiang et.al. | 2407.20904 | null |
2024-07-30 | Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy | Xiaoheng Tan et.al. | 2407.20766 | null |
2024-07-30 | Questionnaires for Everyone: Streamlining Cross-Cultural Questionnaire Adaptation with GPT-Based Translation Quality Evaluation | Otso Haavisto et.al. | 2407.20608 | link |
2024-07-29 | Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods | Hyeon Yu et.al. | 2407.20427 | null |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets | Yili Jin et.al. | 2407.19988 | null |
2024-07-29 | Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation | Shiyuan Li et.al. | 2407.19944 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content | Yuqin Cao et.al. | 2407.19704 | null |
2024-07-29 | Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Wulian Yun et.al. | 2407.19675 | null |
2024-07-28 | X-Fake: Juggling Utility Evaluation and Explanation of Simulated SAR Images | Zhongling Huang et.al. | 2407.19436 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-27 | Towards Clean-Label Backdoor Attacks in the Physical World | Thinh Dao et.al. | 2407.19203 | null |
2024-07-26 | Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network | Tianyu Xiong et.al. | 2407.19082 | null |
2024-07-26 | Correcting for objective sample refractive index mismatch in extended field of view selective plane illumination microscopy | Steven J. Sheppard et.al. | 2407.18862 | null |
2024-07-25 | Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Kailai Zhou et.al. | 2407.17996 | link |
2024-07-29 | Invariance of deep image quality metrics to affine transformations | Nuria Alabau-Bosque et.al. | 2407.17927 | link |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-24 | Final Alignment and Image Quality Test for the Acquisition and Guiding System of SOXS | J. A. Araiza-Duran et.al. | 2407.17382 | null |
2024-07-24 | SOXS NIR: Optomechanical integration and alignment, optical performance verification before full instrument assembly | M. Genoni et.al. | 2407.17244 | null |
2024-07-24 | Q-Ground: Image Quality Grounding with Large Multi-modality Models | Chaofeng Chen et.al. | 2407.17035 | link |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | QPT V2: Masked Image Modeling Advances Visual Scoring | Qizhi Xie et.al. | 2407.16541 | link |
2024-07-23 | ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation | Zhenhua Wu et.al. | 2407.16508 | null |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | A new visual quality metric for Evaluating the performance of multidimensional projections | Maniru Ibrahim et.al. | 2407.16309 | null |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator | Florian Robert et.al. | 2407.15817 | null |
2024-07-22 | SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Daniel Jakab et.al. | 2407.15646 | null |
2024-07-22 | Experimenting with Adaptive Bitrate Algorithms for Virtual Reality Streaming over Wi-Fi | Ferran Maura et.al. | 2407.15614 | link |
2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | link |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | Deep Learning CT Image Restoration using System Blur and Noise Models | Yijie Yuan et.al. | 2407.14983 | null |
2024-07-20 | GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation | Jingzhi Gong et.al. | 2407.14982 | link |
2024-07-20 | Dual High-Order Total Variation Model for Underwater Image Restoration | Yuemei Li et.al. | 2407.14868 | link |
2024-07-20 | CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer | Maximilian E. Tschuchnig et.al. | 2407.14853 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-20 | Difflare: Removing Image Lens Flare with Latent Diffusion Model | Tianwen Zhou et.al. | 2407.14746 | link |
2024-07-20 | Polarimetric compressed sensing with hollow, self-assembled diffractive films | Ji Feng et.al. | 2407.14722 | null |
2024-07-19 | A Minibatch Alternating Projections Algorithm for Robust and Efficient Magnitude Least-Squares RF Pulse Design in MRI | Jonathan B. Martin et.al. | 2407.14696 | link |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | Shape and Style GAN-based Multispectral Data Augmentation for Crop/Weed Segmentation in Precision Farming | Mulham Fawakherji et.al. | 2407.14119 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Ka-Ho Chow et.al. | 2407.13975 | link |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | A Novel Freeform Slicer IFU for the Magellan InfraRed Multi-Object Spectrograph (MIRMOS) | Maren Cosens et.al. | 2407.13747 | null |
2024-07-18 | HazeCLIP: Towards Language Guided Real-World Image Dehazing | Ruiyi Wang et.al. | 2407.13719 | link |
2024-07-18 | Removing cloud shadows from ground-based solar imagery | Amal Chaoui et.al. | 2407.13379 | null |
2024-07-18 | Any Image Restoration with Efficient Automatic Degradation Adaptation | Bin Ren et.al. | 2407.13372 | link |
2024-07-18 | Heterogeneous Clinical Trial Outcomes via Multi-Output Gaussian Processes | Owen Thomas et.al. | 2407.13283 | null |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | null |
2024-07-18 | Enhanced Denoising of OCT Images Using Residual U-Net: A Cross-Modality Approach on PSOCT and ASOCT for Clinical Diagnostics | Akkidas Noel Prakasha et.al. | 2407.13090 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola et.al. | 2407.12511 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | null |
2024-07-16 | Semantic Communication for the Internet of Sounds: Architecture, Design Principles, and Challenges | Chengsi Liang et.al. | 2407.12203 | null |
2024-07-16 | Neural Passage Quality Estimation for Static Pruning | Xuejun Chang et.al. | 2407.12170 | link |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-16 | ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment | Pedro Pons-Suñer et.al. | 2407.11767 | null |
2024-07-16 | Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution | Francesco Pio Ramunno et.al. | 2407.11659 | link |
2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
2024-07-16 | Cover-separable Fixed Neural Network Steganography via Deep Generative Models | Guobiao Li et.al. | 2407.11405 | link |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-15 | UFQA: Utility guided Fingerphoto Quality Assessment | Amol S. Joshi et.al. | 2407.11141 | null |
2024-07-15 | Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu et.al. | 2407.10817 | null |
2024-07-15 | Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation | Seungri Yoon et.al. | 2407.10413 | null |
2024-07-15 | Exploring the Impact of Moire Pattern on Deepfake Detectors | Razaib Tariq et.al. | 2407.10399 | null |
2024-07-14 | Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang et.al. | 2407.10285 | link |
2024-07-14 | Low Sensitivity Hopsets | Vikrant Ashvinkumar et.al. | 2407.10249 | null |
2024-07-14 | A Novel Approach to Ultrasound Beamforming using Synthetic Transmit Aperture with Low Complexity and High SNR for Medical Imaging | Thenmozhi Elango et.al. | 2407.10242 | null |
2024-07-13 | Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment | Yujie Zhang et.al. | 2407.09806 | null |
2024-07-12 | Quantum-dot-based Kitaev chains: Majorana quality measures and scaling with increasing chain length | Viktor Svensson et.al. | 2407.09211 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | 15M Multimodal Facial Image-Text Dataset | Dawei Dai et.al. | 2407.08515 | null |
2024-07-11 | Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives | Diego Dall’Alba et.al. | 2407.08506 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-10 | Coherent and Multi-modality Image Inpainting via Latent Space Optimization | Lingzhi Pan et.al. | 2407.08019 | link |
2024-07-10 | Intensity-sensitive quality assessment of extended sources in astronomical images | X. Li et.al. | 2407.07863 | link |
2024-07-12 | Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou et.al. | 2407.07673 | null |
2024-07-10 | Video In-context Learning | Wentao Zhang et.al. | 2407.07356 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment | K M Arefeen Sultan et.al. | 2407.07254 | null |
2024-07-09 | Scaling Up Personalized Aesthetic Assessment via Task Vector Customization | Jooyeol Yun et.al. | 2407.07176 | link |
2024-07-09 | Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects | Krzysztof Kutt et.al. | 2407.06972 | null |
2024-07-09 | CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao et.al. | 2407.06780 | link |
2024-07-09 | Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang et.al. | 2407.06628 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-09 | Low-dose, high-resolution CT of infant-sized lungs via propagation-based phase contrast | James A. Pollock et.al. | 2407.06527 | null |
2024-07-08 | MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Xuan Ju et.al. | 2407.06358 | null |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation | Shuang Xu et.al. | 2407.06064 | link |
2024-07-08 | MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices | Jianwen Jiang et.al. | 2407.05712 | null |
2024-07-09 | PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-08 | GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method | Zhanxuan Mei et.al. | 2407.05590 | null |
2024-07-08 | Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jiacheng Su et.al. | 2407.05577 | null |
2024-07-06 | Panopticon: a telescope for our times | Will Saunders et.al. | 2407.05103 | null |
2024-07-06 | CLIPVQA:Video Quality Assessment via CLIP | Fengchuang Xing et.al. | 2407.04928 | link |
2024-07-06 | OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding | Tiancheng Zhao et.al. | 2407.04923 | null |
2024-07-05 | MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Zhaorun Chen et.al. | 2407.04842 | link |
2024-07-05 | Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps | Mattias Nilsson et.al. | 2407.04578 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator | Mehryar Abbasi et.al. | 2407.04258 | null |
2024-07-05 | HCS-TNAS: Hybrid Constraint-driven Semi-supervised Transformer-NAS for Ultrasound Image Segmentation | Renqi Chen et.al. | 2407.04203 | null |
2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | link |
2024-07-04 | DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment | Jinsong Shi et.al. | 2407.03886 | link |
2024-07-04 | Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy | Yujie Zhang et.al. | 2407.03885 | link |
2024-07-04 | DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts | Zheng-Peng Duan et.al. | 2407.03757 | null |
2024-07-04 | Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming | Rundong Fan et.al. | 2407.03688 | null |
2024-07-04 | Pathological Semantics-Preserving Learning for H&E-to-IHC Virtual Staining | Fuqiang Chen et.al. | 2407.03655 | link |
2024-07-04 | Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration | Yuhong Zhang et.al. | 2407.03636 | null |
2024-07-04 | Orthogonal Constrained Minimization with Tensor $\ell_{2,p}$ Regularization for HSI Denoising and Destriping | Xiaoxia Liu et.al. | 2407.03605 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | Single Image Rolling Shutter Removal with Diffusion Models | Zhanglei Yang et.al. | 2407.02906 | null |
2024-07-03 | FedPot: A Quality-Aware Collaborative and Incentivized Honeypot-Based Detector for Smart Grid Networks | Abdullatif Albaseer et.al. | 2407.02845 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-03 | SF-GNN: Self Filter for Message Lossless Propagation in Deep Graph Neural Network | Yushan Zhu et.al. | 2407.02762 | null |
2024-07-03 | MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control | Yeonji Lee et.al. | 2407.02736 | null |
2024-07-02 | Meta 3D Gen | Raphael Bensadoun et.al. | 2407.02599 | null |
2024-07-02 | Off-Grid Ultrasound Imaging by Stochastic Optimization | Vincent van de Schaft et.al. | 2407.02285 | link |
2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | null |
2024-07-01 | Free-text Rationale Generation under Readability Level Control | Yi-Sheng Hsu et.al. | 2407.01384 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-07-01 | Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction | Bin Huang et.al. | 2407.00944 | link |
2024-06-30 | A Comparative Study of Quality Evaluation Methods for Text Summarization | Huyen Nguyen et.al. | 2407.00747 | null |
2024-06-30 | DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models | Wenda Wang et.al. | 2407.00560 | null |
2024-06-29 | Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology | Zurh Farus et.al. | 2407.00513 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | Benchmark Evaluation of Image Fusion algorithms for Smartphone Camera Capture | Lucas N. Kirsten et.al. | 2407.00301 | null |
2024-06-28 | PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration | Yuxuan Sun et.al. | 2407.00203 | null |
2024-06-28 | Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence | Mike Thelwall et.al. | 2407.00135 | null |
2024-06-28 | MR-zero meets FLASH – Controlling the transient signal decay in gradient- and rf-spoiled gradient echo sequences | Simon Weinmüller et.al. | 2406.19877 | null |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound | Deepak Raina et.al. | 2406.19678 | null |
2024-06-28 | PopAlign: Population-Level Alignment for Fair Text-to-Image Generation | Shufan Li et.al. | 2406.19668 | link |
2024-06-27 | Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation | Jack Highton et.al. | 2406.19557 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al. | 2406.19393 | link |
2024-06-27 | AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI | Kaveen Hiniduma et.al. | 2406.19256 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-27 | Local Manifold Learning for No-Reference Image Quality Assessment | Timin Gao et.al. | 2406.19247 | null |
2024-06-27 | Complex-valued scatter compensation in nonlinear microscopy | Maximilian Sohmen et.al. | 2406.19031 | null |
2024-06-27 | Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model | Jiangtong Tan et.al. | 2406.19030 | link |
2024-06-26 | IDA-UIE: An Iterative Framework for Deep Network-based Degradation Aware Underwater Image Enhancement | Pranjali Singh et.al. | 2406.18628 | null |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation | Shenghai Yuan et.al. | 2406.18522 | link |
2024-06-26 | MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal | Yiguo Jiang et.al. | 2406.18079 | link |
2024-06-26 | Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Qilai Zhang et.al. | 2406.18054 | link |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation | Bowei Yao et.al. | 2406.17578 | null |
2024-06-25 | UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment | Vlad Hosu et.al. | 2406.17472 | null |
2024-06-25 | Leveraging LLMs for Dialogue Quality Measurement | Jinghan Jia et.al. | 2406.17304 | null |
2024-06-25 | HD snapshot diffractive spectral imaging and inferencing | Apratim Majumder et.al. | 2406.17302 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
2024-06-24 | Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Bei Yan et.al. | 2406.17115 | link |
2024-06-24 | Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation | Zhenyi Liao et.al. | 2406.17100 | link |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation | Katherine M. Collins et.al. | 2406.16807 | null |
2024-06-24 | Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment | Jun Fu et.al. | 2406.16641 | link |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors | Ming-Che Li et.al. | 2406.16358 | null |
2024-06-24 | Priorformer: A UGC-VQA Method with content and distortion priors | Yajing Pei et.al. | 2406.16297 | null |
2024-06-23 | Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation | Rafael Redondo et.al. | 2406.16155 | null |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-22 | Quality-guided Skin Tone Enhancement for Portrait Photography | Shiqi Gao et.al. | 2406.15848 | null |
2024-06-21 | Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction | Mojtaba Safari et.al. | 2406.15656 | null |
2024-06-21 | Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora et.al. | 2406.15576 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-24 | VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation | Xuan He et.al. | 2406.15252 | null |
2024-06-21 | Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior | Junbo Peng et.al. | 2406.15219 | null |
2024-06-21 | Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization | Jeremiah Fadugba et.al. | 2406.14994 | link |
2024-06-21 | Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning | Xu Han et.al. | 2406.14847 | null |
2024-06-21 | Is this a bad table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu et.al. | 2406.14829 | null |
2024-06-20 | Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu et.al. | 2406.14643 | null |
2024-06-20 | A Fuzzy Logic-Based Quality Model For Identifying Microservices With Low Maintainability | Rahime Yilmaz et.al. | 2406.14489 | null |
2024-06-20 | Enhancing multivariate post-processed visibility predictions utilizing CAMS forecasts | Mária Lakatos et.al. | 2406.14159 | null |
2024-06-20 | EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations | Jie Ren et.al. | 2406.13933 | null |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator | Gianlorenzo Massaro et.al. | 2406.13501 | null |
2024-06-19 | ALiiCE: Evaluating Positional Fine-grained Citation Generation | Yilong Xu et.al. | 2406.13375 | link |
2024-06-19 | AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models | Ken Chen et.al. | 2406.13272 | null |
2024-06-19 | New methods for ALMA angular-scale based observation scheduling, quality assessment, and beam shaping II: refinements | Dirk Petry et.al. | 2406.13199 | null |
2024-06-18 | NTIRE 2024 Challenge on Night Photography Rendering | Egor Ershov et.al. | 2406.13007 | null |
2024-06-18 | Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE | Edith Heiter et.al. | 2406.12953 | link |
2024-06-18 | Automatic generation of insights from workers’ actions in industrial workflows with explainable Machine Learning | Francisco de Arriba-Pérez et.al. | 2406.12732 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Training Diffusion Models with Federated Learning | Matthijs de Goede et.al. | 2406.12575 | null |
2024-06-18 | Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Sophie Loizillon et.al. | 2406.12448 | link |
2024-06-18 | AI-Assisted Human Evaluation of Machine Translation | Vilém Zouhar et.al. | 2406.12419 | link |
2024-06-18 | SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions | Yuexiong Ding et.al. | 2406.12395 | null |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | null |
2024-06-17 | Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation | Boxuan Lyu et.al. | 2406.11632 | null |
2024-06-17 | Compressed Skinning for Facial Blendshapes | Ladislav Kavan et.al. | 2406.11597 | null |
2024-06-17 | Energy Reduction Opportunities in HDR Video Encoding | Christian Herglotz et.al. | 2406.11492 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-17 | Incentivizing Quality Text Generation via Statistical Contracts | Eden Saig et.al. | 2406.11118 | link |
2024-06-16 | Parameter Blending for Multi-Camera Harmonization for Automotive Surround View Systems | Yuzhuo Ren et.al. | 2406.11066 | null |
2024-06-16 | SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction | Yuxun Tang et.al. | 2406.10911 | null |
2024-06-15 | MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images | Tao Yan et.al. | 2406.10652 | null |
2024-06-15 | Exploring the Impact of AI-generated Image Tools on Professional and Non-professional Users in the Art and Design Fields | Yuying Tang et.al. | 2406.10640 | null |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-15 | CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation | Wei Chen et.al. | 2406.10462 | null |
2024-06-14 | Consistency-diversity-realism Pareto fronts of conditional image generative models | Pietro Astolfi et.al. | 2406.10429 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators | Jaden Pieper et.al. | 2406.10205 | null |
2024-06-14 | D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Moritz Kappel et.al. | 2406.10078 | null |
2024-06-14 | Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Fei Zhou et.al. | 2406.09858 | null |
2024-06-14 | Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets | Ryosuke Watanabe et.al. | 2406.09762 | null |
2024-06-14 | Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion | Qiang Zhu et.al. | 2406.09693 | null |
2024-06-13 | DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer | Wei-Ting Chen et.al. | 2406.09622 | null |
2024-06-13 | Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment | Fengbin Guan et.al. | 2406.09546 | null |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Desai Xie et.al. | 2406.09371 | link |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning | Giuseppe Vecchio et.al. | 2406.09293 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution | Wanli Wen et.al. | 2406.08806 | null |
2024-06-13 | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | Mingwang Xu et.al. | 2406.08801 | null |
2024-06-13 | FouRA: Fourier Low Rank Adaptation | Shubhankar Borse et.al. | 2406.08798 | null |
2024-06-12 | Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods | Eugene Vyborov et.al. | 2406.08582 | null |
2024-06-12 | IMFL-AIGC: Incentive Mechanism Design for Federated Learning Empowered by Artificial Intelligence Generated Content | Guangjing Huang et.al. | 2406.08526 | null |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | link |
2024-06-12 | WMAdapter: Adding WaterMark Control to Latent Diffusion Models | Hai Ci et.al. | 2406.08337 | null |
2024-06-12 | Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation | Javad Pourmostafa Roshan Sharami et.al. | 2406.07970 | link |
2024-06-12 | DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera | Senyan Xu et.al. | 2406.07951 | link |
2024-06-12 | Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation | Jiadong Liang et.al. | 2406.07895 | null |
2024-06-11 | A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection | Can Akbas et.al. | 2406.07694 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | Accurate estimate of the ESPRESSO fiber-injection losses inferred from integrated field-stabilization images | Tobias M. Schmidt et.al. | 2406.07193 | null |
2024-06-11 | Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Yuanhao Zhai et.al. | 2406.06890 | link |
2024-06-11 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality | Duc Nguyen et.al. | 2406.06888 | null |
2024-06-09 | Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises | Jianhua Pei et.al. | 2406.06644 | null |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | null |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202 | null |
2024-06-10 | Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios | Raül Pérez-Gonzalo et.al. | 2406.06165 | null |
2024-06-10 | JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis | Hyunjae Cho et.al. | 2406.06111 | null |
2024-06-10 | GAIA: Rethinking Action Quality Assessment for AI-Generated Videos | Zijian Chen et.al. | 2406.06087 | link |
2024-06-10 | FRAG: Frequency Adapting Group for Diffusion Video Editing | Sunjae Yoon et.al. | 2406.06044 | link |
2024-06-12 | MLCM: Multistep Consistency Distillation of Latent Diffusion Model | Qingsong Xie et.al. | 2406.05768 | link |
2024-06-08 | Energy-Efficient Approximate Full Adders Applying Memristive Serial IMPLY Logic For Image Processing | Seyed Erfan Fatemieh et.al. | 2406.05525 | null |
2024-06-08 | Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid | Thanh-Huy Nguyen et.al. | 2406.05349 | null |
2024-06-08 | Deep convolutional demosaicking network for multispectral polarization filter array | Tomoharu Ishiuchi et.al. | 2406.05312 | null |
2024-06-08 | YouTube SFV+HDR Quality Dataset | Yilin Wang et.al. | 2406.05305 | null |
2024-06-07 | Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis | Ryan Langman et.al. | 2406.05298 | null |
2024-06-07 | GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications | Shakhnaz Akhmedova et.al. | 2406.05023 | link |
2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-07 | The Active Optics System on the Vera C. Rubin Observatory: Optimal Control of Degeneracy Among the Large Number of Degrees of Freedom | Guillem Megias Homar et.al. | 2406.04656 | null |
2024-06-07 | GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Diptanu De et.al. | 2406.04654 | null |
2024-06-07 | StreamOptix: A Cross-layer Adaptive Video Delivery Scheme | Mufan Liu et.al. | 2406.04632 | link |
2024-06-07 | Attention Fusion Reverse Distillation for Multi-Lighting Image Anomaly Detection | Yiheng Zhang et.al. | 2406.04573 | null |
2024-06-06 | Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Reyhane Askari Hemmat et.al. | 2406.04551 | null |
2024-06-06 | A Versatile Collage Visualization Technique | Zhenyu Wang et.al. | 2406.04008 | null |
2024-06-06 | JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits | Minzhou Pan et.al. | 2406.03720 | link |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-05 | Anatomy-based quality metric of diffusion-weighted MRI data for accurate derivation of muscle fiber orientation | Nadya Shusharina et.al. | 2406.03560 | null |
2024-06-05 | Globally and Locally Optimized Pannini Projection for High FoV Rendering of 360-degree Images | Falah Jabar et.al. | 2406.03282 | null |
2024-06-05 | FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker | Xiaopeng Lin et.al. | 2406.03177 | null |
2024-06-05 | Dynamic 3D Gaussian Fields for Urban Areas | Tobias Fischer et.al. | 2406.03175 | null |
2024-06-05 | The new Herschel/PACS Point Source Catalogue | Gábor Marton et.al. | 2406.03116 | null |
2024-06-05 | A-Bench: Are LMMs Masters at Evaluating AI-generated Images? | Zicheng Zhang et.al. | 2406.03070 | link |
2024-06-05 | DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain | Jun Liu et.al. | 2406.03017 | link |
2024-06-05 | Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms | Firas Trabelsi et.al. | 2406.02832 | null |
2024-06-04 | ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Tianchen Zhao et.al. | 2406.02540 | link |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-04 | I4VGen: Image as Stepping Stone for Text-to-Video Generation | Xiefan Guo et.al. | 2406.02230 | null |
2024-06-04 | OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Chenyang Huang et.al. | 2406.01919 | null |
2024-06-04 | Rank-based No-reference Quality Assessment for Face Swapping | Xinghui Zhou et.al. | 2406.01884 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-03 | DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$ -transform | Alexander Denker et.al. | 2406.01781 | link |
2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
2024-06-03 | Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction | Rita Pucci et.al. | 2406.01294 | link |
2024-06-03 | Dimba: Transformer-Mamba Diffusion Models | Zhengcong Fei et.al. | 2406.01159 | null |
2024-06-03 | Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline | Jan Lippemeier et.al. | 2406.01071 | null |
2024-06-03 | UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment | Hantao Zhou et.al. | 2406.01069 | link |
2024-06-03 | CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment | Daekyu Kwon et.al. | 2406.01020 | null |
2024-06-02 | EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing | Hadrien Reynaud et.al. | 2406.00808 | link |
2024-06-04 | Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models | Cristiano Patrício et.al. | 2406.00772 | link |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner | Xing Cui et.al. | 2406.00432 | link |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Tsang’s resolution enhancement method for imaging with focused illumination | Alexander Duplinskiy et.al. | 2405.20979 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-30 | An Automatic Question Usability Evaluation Toolkit | Steven Moore et.al. | 2405.20529 | link |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | CoSy: Evaluating Textual Explanations of Neurons | Laura Kopf et.al. | 2405.20331 | link |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion | Jiangkai Wu et.al. | 2405.20032 | null |
2024-06-03 | DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild | Honghao Fu et.al. | 2405.19996 | link |
2024-05-29 | CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning | Yiping Wang et.al. | 2405.19547 | null |
2024-05-29 | A Full-duplex Speech Dialogue Scheme Based On Large Language Models | Peng Wang et.al. | 2405.19487 | null |
2024-05-29 | VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture | Heesup Yun et.al. | 2405.19413 | null |
2024-05-29 | Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare | Hanwei Zhu et.al. | 2405.19298 | link |
2024-05-29 | A study on the adequacy of common IQA measures for medical images | Anna Breger et.al. | 2405.19224 | link |
2024-05-29 | A study of why we need to reassess full reference image quality assessment with medical images | Anna Breger et.al. | 2405.19097 | null |
2024-05-31 | Benchmarking and Improving Detail Image Caption | Hongyuan Dong et.al. | 2405.19092 | link |
2024-05-29 | Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization | Zhiwei Tang et.al. | 2405.18881 | link |
2024-05-29 | Descriptive Image Quality Assessment in the Wild | Zhiyuan You et.al. | 2405.18842 | null |
2024-05-29 | Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics | Zhangkai Ni et.al. | 2405.18790 | link |
2024-05-28 | Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? | Zebin You et.al. | 2405.18029 | null |
2024-05-30 | Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | Zhenjie Zhang et.al. | 2405.17934 | null |
2024-05-30 | MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Tianchen Zhao et.al. | 2405.17873 | null |
2024-05-28 | PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild | Kun Yuan et.al. | 2405.17765 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-27 | Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba | Jiahao Huang et.al. | 2405.17659 | null |
2024-05-27 | Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction | Wenhao Zhang et.al. | 2405.17167 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control | Arle Lommel et.al. | 2405.16969 | null |
2024-05-27 | EM Distillation for One-step Diffusion Models | Sirui Xie et.al. | 2405.16852 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-26 | Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging | Chong Chen et.al. | 2405.16715 | null |
2024-05-26 | Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping | Chao Li et.al. | 2405.16664 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination | Shelly Golan et.al. | 2405.16260 | null |
2024-05-25 | Maintaining and Managing Road Quality:Using MLP and DNN | Makgotso Jacqueline Maotwana et.al. | 2405.16196 | null |
2024-05-25 | Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection | Yun Zhu et.al. | 2405.16178 | null |
2024-05-24 | Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model | Lang Zhang et.al. | 2405.15830 | null |
2024-05-24 | Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction | Yuyang Xue et.al. | 2405.15517 | link |
2024-05-24 | Benchmarking Pre-trained Large Language Models’ Potential Across Urdu NLP tasks | Munief Hassan Tahir et.al. | 2405.15453 | null |
2024-05-24 | Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image | Hyeonjae Gil et.al. | 2405.15395 | link |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-24 | Seeing the World through an Antenna’s Eye: Reception Quality Visualization Using Incomplete Technical Signal Information | Leif Bergerhoff et.al. | 2405.15253 | null |
2024-05-24 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography | Shuo Han et.al. | 2405.14770 | null |
2024-05-23 | Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms | Aditya Jonnalagadda et.al. | 2405.14720 | null |
2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
2024-05-24 | Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Guanxiong Luo et.al. | 2405.14327 | link |
2024-05-23 | Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization | Zhibo Chen et.al. | 2405.14221 | null |
2024-05-22 | Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior | Lorenzo Perini et.al. | 2405.13699 | null |
2024-05-22 | Euclid: Early Release Observations – Programme overview and pipeline for compact- and diffuse-emission photometry | J. -C. Cuillandre et.al. | 2405.13496 | null |
2024-05-25 | Class-Conditional self-reward mechanism for improved Text-to-Image models | Safouane El Ghazouali et.al. | 2405.13473 | link |
2024-05-22 | Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications | Md. Toukir Ahmed et.al. | 2405.13331 | null |
2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | link |
2024-05-24 | Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction | Maciej Kilian et.al. | 2405.13218 | null |
2024-05-21 | NieR: Normal-Based Lighting Scene Rendering | Hongsheng Wang et.al. | 2405.13097 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? | Ziqin Lin et.al. | 2405.12584 | null |
2024-05-20 | Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI | Di Xu et.al. | 2405.12357 | null |
2024-05-20 | Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product | Md. Toukir Ahmed et.al. | 2405.12313 | null |
2024-05-20 | GGAvatar: Geometric Adjustment of Gaussian Head Avatar | Xinyang Li et.al. | 2405.11993 | null |
2024-05-20 | On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie et.al. | 2405.11919 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-19 | Solar image quality assessment: a proof of concept using Variance of Laplacian method and its application to optical atmospheric condition monitoring | Chu Wing So et.al. | 2405.11490 | null |
2024-05-18 | Sampling Strategies for Mitigating Bias in Face Synthesis Methods | Emmanouil Maragkoudakis et.al. | 2405.11320 | null |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | Testing the Performance of Face Recognition for People with Down Syndrome | Christian Rathgeb et.al. | 2405.11240 | null |
2024-05-21 | SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu et.al. | 2405.10650 | link |
2024-05-17 | Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI | Yirong Zhou et.al. | 2405.10570 | null |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed Ilyes Lakhal et.al. | 2405.10423 | null |
2024-05-16 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-16 | Semantic Communication via Rate Distortion Perception Bottleneck | Zihe Zhao et.al. | 2405.09995 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge | Jie Liang et.al. | 2405.09923 | null |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-16 | Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images | Memoona Aziz et.al. | 2405.09426 | null |
2024-05-15 | Application of Gated Recurrent Units for CT Trajectory Optimization | Yuedong Yuan et.al. | 2405.09333 | null |
2024-05-21 | Deep Blur Multi-Model (DeepBlurMM) - a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis | Yujie Xiang et.al. | 2405.09298 | null |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-15 | Shacl4Bib: custom validation of library data | Péter Király et.al. | 2405.09177 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-14 | Chemically peculiar stars on the pre-main sequence | L. Kueß et.al. | 2405.08946 | null |
2024-05-14 | Enhancing Blind Video Quality Assessment with Rich Quality-aware Features | Wei Sun et.al. | 2405.08745 | link |
2024-05-13 | The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective | Andrew Shin et.al. | 2405.08720 | null |
2024-05-14 | Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs | P. Mas-Buitrago et.al. | 2405.08703 | link |
2024-05-15 | RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content | Tianhao Peng et.al. | 2405.08621 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | link |
2024-05-14 | WaterMamba: Visual State Space Model for Underwater Image Enhancement | Meisheng Guan et.al. | 2405.08419 | null |
2024-05-14 | Perivascular space Identification Nnunet for Generalised Usage (PINGU) | Benjamin Sinclair et.al. | 2405.08337 | link |
2024-05-14 | Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategy | Xiameng Wei et.al. | 2405.08245 | link |
2024-05-13 | Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints | Guangjin Pan et.al. | 2405.07689 | null |
2024-05-15 | PRANK: a singular value based noise filtering approach | Francesco Trainotti et.al. | 2405.07578 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-12 | Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning | Jiarui Wang et.al. | 2405.07346 | link |
2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
2024-05-12 | Stable Signature is Unstable: Removing Image Watermark from Diffusion Models | Yuepeng Hu et.al. | 2405.07145 | null |
2024-05-11 | Large Language Model-aided Edge Learning in Distribution System State Estimation | Renyou Xie et.al. | 2405.06999 | null |
2024-05-15 | Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity | Zihang Jia et.al. | 2405.06904 | null |
2024-05-11 | FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | Jinglin Xu et.al. | 2405.06887 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Compression-Realized Deep Structural Network for Video Quality Enhancement | Hanchi Sun et.al. | 2405.06342 | null |
2024-05-09 | Perceptual Crack Detection for Rendered 3D Textured Meshes | Armin Shafiee Sarvestani et.al. | 2405.06143 | link |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | LatentColorization: Latent Diffusion-Based Speaker Video Colorization | Rory Ward et.al. | 2405.05707 | null |
2024-05-09 | SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space | Zeren Zhang et.al. | 2405.05636 | null |
2024-05-09 | Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data | Yangyang Wang et.al. | 2405.05565 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | Bridging the Gap Between Saliency Prediction and Image Quality Assessment | Kirillov Alexey et.al. | 2405.04997 | link |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | Dogucan Yaman et.al. | 2405.04327 | null |
2024-05-07 | Cross-IQA: Unsupervised Learning for Image Quality Assessment | Zhen Zhang et.al. | 2405.04311 | null |
2024-05-07 | Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models | Zhixuan Chu et.al. | 2405.04180 | link |
2024-05-07 | Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment | Aobo Li et.al. | 2405.04167 | null |
2024-05-07 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints | Xiongjun Guan et.al. | 2405.03959 | link |
2024-05-06 | AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration | Widad Elouataoui et.al. | 2405.03870 | null |
2024-05-06 | Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction | Jinho Kim et.al. | 2405.03732 | null |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-06 | An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks | Peng Jia et.al. | 2405.03408 | null |
2024-05-06 | Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement | Jiesong Bai et.al. | 2405.03349 | link |
2024-05-06 | Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance | Xunchu Zhou et.al. | 2405.03333 | link |
2024-05-06 | Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning | Jiewen Deng et.al. | 2405.03255 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens | Shaohua Gao et.al. | 2405.02942 | null |
2024-05-05 | Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration | Xiaole Tang et.al. | 2405.02843 | link |
2024-05-04 | Deep Image Restoration For Image Anti-Forensics | Eren Tahir et.al. | 2405.02751 | link |
2024-05-04 | DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model | Liangqi Lei et.al. | 2405.02696 | null |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | null |
2024-05-02 | Singular Value and Frame Decomposition-based Reconstruction for Atmospheric Tomography | Lukas Weissinger et.al. | 2405.01079 | null |
2024-05-01 | Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer | Hui Lin et.al. | 2405.00857 | link |
2024-05-01 | Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu et.al. | 2405.00760 | null |
2024-05-01 | Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays | Andrei Chubarau et.al. | 2405.00670 | link |
2024-05-01 | Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | Yuxi Xie et.al. | 2405.00451 | link |
2024-04-30 | Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review | Mojtaba Safari et.al. | 2405.00241 | link |
2024-04-30 | Charting the Path Forward: CT Image Quality Assessment – An In-Depth Review | Siyi Xun et.al. | 2405.00075 | null |
2024-04-30 | Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity | Lei Wang et.al. | 2404.19666 | null |
2024-04-30 | Perceptual Constancy Constrained Single Opinion Score Calibration for Image Quality Assessment | Lei Wang et.al. | 2404.19595 | null |
2024-04-30 | Causal Perception Inspired Representation Learning for Trustworthy Image Quality Assessment | Lei Wang et.al. | 2404.19567 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-30 | Global Search Optics: Automatically Exploring Optimal Solutions to Compact Computational Imaging Systems | Yao Gao et.al. | 2404.19201 | null |
2024-04-30 | Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging | Zheren Zhu et.al. | 2404.19167 | link |
2024-04-29 | A Comprehensive Rubric for Annotating Pathological Speech | Mario Corrales-Astorgano et.al. | 2404.18851 | null |
2024-04-29 | Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology | Luzhe Huang et.al. | 2404.18458 | null |
2024-04-29 | PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images | Jiquan Yuan et.al. | 2404.18409 | link |
2024-04-29 | G-Refine: A General Quality Refiner for Text-to-Image Generation | Chunyi Li et.al. | 2404.18343 | link |
2024-04-28 | An automated pipeline for computation and analysis of functional ventilation and perfusion lung MRI with matrix pencil decomposition: TrueLung | Orso Pusterla et.al. | 2404.18275 | null |
2024-04-28 | LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM | Zicheng Zhang et.al. | 2404.18203 | link |
2024-04-28 | Assessing Image Quality Using a Simple Generative Representation | Simon Raviv et.al. | 2404.18178 | link |
2024-04-28 | fMRI Exploration of Visual Quality Assessment | Yiming Zhang et.al. | 2404.18162 | null |
2024-04-27 | Quality Estimation with $k$ -nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation | Tu Anh Dinh et.al. | 2404.18031 | null |
2024-04-27 | LpQcM: Adaptable Lesion-Quantification-Consistent Modulation for Deep Learning Low-Count PET Image Denoising | Menghua Xia et.al. | 2404.17994 | null |
2024-04-27 | From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching | Nannan Wu et.al. | 2404.17805 | link |
2024-04-27 | Large Multi-modality Model Assisted AI-Generated Image Quality Assessment | Puyi Wang et.al. | 2404.17762 | link |
2024-04-27 | Segmentation Quality and Volumetric Accuracy in Medical Imaging | Zheyuan Zhang et.al. | 2404.17742 | null |
2024-04-27 | Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission | Mingyu Yang et.al. | 2404.17736 | link |
2024-04-26 | Attention-aware non-rigid image registration for accelerated MR imaging | Aya Ghoul et.al. | 2404.17621 | link |
2024-04-26 | Low Cost Machine Vision for Insect Classification | Danja Brandt et.al. | 2404.17488 | null |
2024-04-26 | S-IQA Image Quality Assessment With Compressive Sampling | Ronghua Liao et.al. | 2404.17170 | null |
2024-04-25 | ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images | Weiqi Li et.al. | 2404.16825 | null |
2024-04-25 | NTIRE 2024 Quality Assessment of AI-Generated Content Challenge | Xiaohong Liu et.al. | 2404.16687 | null |
2024-04-25 | Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior | Han Wang et.al. | 2404.16678 | null |
2024-04-25 | Application of RESNET50 Convolution Neural Network for the Extraction of Optical Parameters in Scattering Media | Bowen Deng et.al. | 2404.16647 | null |
2024-04-25 | COBRA – COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images | Panagiotis Sapoutzoglou et.al. | 2404.16471 | link |
2024-04-25 | PAD: Patch-Agnostic Defense against Adversarial Patch Attacks | Lihua Jing et.al. | 2404.16452 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-24 | AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results | Marcos V. Conde et.al. | 2404.16205 | link |
2024-04-24 | Quantitative Characterization of Retinal Features in Translated OCTA | Rashadul Hasan Badhon et.al. | 2404.16133 | null |
2024-04-24 | Assessment of the quality of a prediction | Roger Sewell et.al. | 2404.15764 | null |
2024-04-24 | A stochastic approach to estimate distribution grid state with confidence regions | Rasmus L. Olsen et.al. | 2404.15722 | null |
2024-04-24 | Deep Learning for Accelerated and Robust MRI Reconstruction: a Review | Reinhard Heckel et.al. | 2404.15692 | null |
2024-04-24 | Neural network-based recognition of multiple nanobubbles in graphene | Subin Kim et.al. | 2404.15658 | null |
2024-04-24 | PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing | Yutong Chen et.al. | 2404.15638 | null |
2024-04-24 | Direct Zernike Coefficient Prediction from Point Spread Functions and Extended Images using Deep Learning | Yong En Kok et.al. | 2404.15231 | null |
2024-04-23 | Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment | Tianwei Zhou et.al. | 2404.15163 | null |
2024-04-23 | Multi-Modal Prompt Learning on Blind Image Quality Assessment | Wensheng Pan et.al. | 2404.14949 | link |
2024-04-23 | Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing | Jay Lee et.al. | 2404.14728 | null |
2024-04-22 | Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming | Haopeng Wang et.al. | 2404.14573 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-22 | Narrative Action Evaluation with Prompt-Guided Multimodal Interaction | Shiyi Zhang et.al. | 2404.14471 | link |
2024-04-22 | CrossScore: Towards Multi-View Image Evaluation and Scoring | Zirui Wang et.al. | 2404.14409 | null |
2024-04-22 | Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave Imaging | Ryan A. L. Schoop et.al. | 2404.14188 | link |
2024-04-22 | Text in the Dark: Extremely Low-Light Text Image Enhancement | Che-Tsung Lin et.al. | 2404.14135 | null |
2024-04-22 | CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task | Kangzhen Yang et.al. | 2404.14132 | link |
2024-04-22 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment | Kanglei Zhou et.al. | 2404.13999 | link |
2024-04-22 | SI-FID: Only One Objective Indicator for Evaluating Stitched Images | Xinrui Zhang et.al. | 2404.13905 | null |
2024-04-21 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap | Bowen Qu et.al. | 2404.13573 | link |
2024-04-21 | Cell Phone Image-Based Persian Rice Detection and Classification Using Deep Learning Techniques | Mahmood Saeedi kelishami et.al. | 2404.13555 | null |
2024-04-20 | Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content | Abhinau K. Venkataramanan et.al. | 2404.13484 | null |
2024-04-20 | Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2404.13452 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-20 | PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition | Xi Fang et.al. | 2404.13299 | null |
2024-04-20 | Beyond Score Changes: Adversarial Attack on No-Reference Image Quality Assessment from Two Perspectives | Chenxi Yang et.al. | 2404.13277 | null |
2024-04-19 | A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks | Ronglei Ji et.al. | 2404.13018 | link |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture | Zarif Ahmed et.al. | 2404.12986 | null |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-19 | 3D Multi-frame Fusion for Video Stabilization | Zhan Peng et.al. | 2404.12887 | null |
2024-04-19 | ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation | Yu-Hsuan Ho et.al. | 2404.12606 | null |
2024-04-18 | Plane-wave compounding with adaptive joint coherence factor weighting | Nikunj Khetan et.al. | 2404.12533 | link |
2024-04-18 | Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data | Shengxi Gui et.al. | 2404.12487 | null |
2024-04-18 | On the Content Bias in Fréchet Video Distance | Songwei Ge et.al. | 2404.12391 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes | Jan Niklas Kolf et.al. | 2404.12203 | link |
2024-04-18 | Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models | Yuzhu Cai et.al. | 2404.12104 | null |
2024-04-18 | Seeing Motion at Nighttime with an Event Camera | Haoyue Liu et.al. | 2404.11884 | link |
2024-04-18 | Automated tomographic assessment of structural defects of freeze-dried pharmaceuticals | Patric Müller et.al. | 2404.11867 | null |
2024-04-18 | Multiphoton super-resolution imaging via virtual structured illumination | Sumin Lim et.al. | 2404.11849 | null |
2024-04-17 | Analysis of blurring due to short T2 decay at different resolutions in 23Na MRI | Olga Dergachyova et.al. | 2404.11774 | null |
2024-04-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al. | 2404.11429 | null |
2024-04-17 | Achromatic Full Stokes Polarimetry Metasurface for Full-color Polarization Imaging in the Visible | Yueqiang Hu et.al. | 2404.11415 | null |
2024-04-17 | Toward Understanding the Disagreement Problem in Neural Network Feature Attribution | Niklas Koenen et.al. | 2404.11330 | link |
2024-04-17 | NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results | Xin Li et.al. | 2404.11313 | link |
2024-04-18 | Study on the static detection of ICF target based on muonic X-ray sphere encoded imaging | Dikai Li et.al. | 2404.11278 | null |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | ONOT: a High-Quality ICAO-compliant Synthetic Mugshot Dataset | Nicolò Di Domenico et.al. | 2404.11236 | null |
2024-04-17 | Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey | Nicolas Chahine et.al. | 2404.11159 | link |
2024-04-17 | MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training | Jiayang Li et.al. | 2404.11016 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time | Sicheng Xu et.al. | 2404.10667 | null |
2024-04-16 | A Computer Vision-Based Quality Assessment Technique for the automatic control of consumables for analytical laboratories | Meriam Zribi et.al. | 2404.10454 | null |
2024-04-16 | OneActor: Consistent Character Generation via Cluster-Conditioned Guidance | Jiahao Wang et.al. | 2404.10267 | null |
2024-04-16 | Diffusion assisted image reconstruction in optoacoustic tomography | M. G. González et.al. | 2404.10239 | null |
2024-04-16 | Novel Method to Estimate Kinetic Microparameters from Dynamic Whole-Body Imaging in Regular-Axial Field-of-View PET Scanners | Kyung-Nam Lee et.al. | 2404.10197 | null |
2024-04-15 | Quality Assessment of Prompts Used in Code Generation | Mohammed Latif Siddiq et.al. | 2404.10155 | null |
2024-04-15 | ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis | Aashish Anantha Ramakrishnan et.al. | 2404.10141 | link |
2024-04-15 | Ti-Patch: Tiled Physical Adversarial Patch for no-reference video quality metrics | Victoria Leonenkova et.al. | 2404.09961 | link |
2024-04-15 | The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models | Ngoc-Giau Pham et.al. | 2404.09817 | null |
2024-04-15 | Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages | Paramita Das et.al. | 2404.09764 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | AI Competitions and Benchmarks: Dataset Development | Romain Egele et.al. | 2404.09703 | null |
2024-04-15 | Are Large Language Models Reliable Argument Quality Annotators? | Nailia Mirzakhmedova et.al. | 2404.09696 | link |
2024-04-15 | Real-world Instance-specific Image Goal Navigation for Service Robots: Bridging the Domain Gap with Contrastive Learning | Taichi Sakaguchi et.al. | 2404.09645 | null |
2024-04-15 | AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation | Žiga Babnik et.al. | 2404.09555 | link |
2024-04-15 | WiTUnet: A U-Shaped Architecture Integrating CNN and Transformer for Improved Feature Alignment and Local Information Fusion | Bin Wang et.al. | 2404.09533 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-14 | Exploring Generative AI for Sim2Real in Driving Data Synthesis | Haonan Zhao et.al. | 2404.09111 | null |
2024-04-13 | A Parametric Rate-Distortion Model for Video Transcoding | Maedeh Jamali et.al. | 2404.09029 | null |
2024-04-13 | THQA: A Perceptual Quality Assessment Database for Talking Heads | Yingjie Zhou et.al. | 2404.09003 | link |
2024-04-13 | PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos | Qi Zhao et.al. | 2404.08921 | null |
2024-04-12 | Multi-Branch Generative Models for Multichannel Imaging with an Application to PET/CT Joint Reconstruction | Noel Jeffrey Pinton et.al. | 2404.08748 | null |
2024-04-12 | Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Yanhao Zheng et.al. | 2404.08603 | link |
2024-04-12 | Self-Supervised k-Space Regularization for Motion-Resolved Abdominal MRI Using Neural Implicit k-Space Representation | Veronika Spieker et.al. | 2404.08350 | link |
2024-04-11 | Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis | Marc Aubreville et.al. | 2404.07676 | link |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | Adversarial purification for no-reference image-quality metrics: applicability study and new methods | Aleksandr Gushchin et.al. | 2404.06957 | null |
2024-04-10 | Perception-Oriented Video Frame Interpolation via Asymmetric Blending | Guangyang Wu et.al. | 2404.06692 | link |
2024-04-10 | CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs’ (Lack of) Multicultural Knowledge | Yu Ying Chiu et.al. | 2404.06664 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | Low-Cost Generation and Evaluation of Dictionary Example Sentences | Bill Cai et.al. | 2404.06224 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-09 | Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis | Sekeun Kim et.al. | 2404.05916 | null |
2024-04-06 | Study of the effect of Sharpness on Blind Video Quality Assessment | Anantha Prabhu et.al. | 2404.05764 | null |
2024-04-08 | A Training-Free Plug-and-Play Watermark Framework for Stable Diffusion | Guokai Zhang et.al. | 2404.05607 | null |
2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | null |
2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | null |
2024-04-08 | Progressive Alignment with VLM-LLM Feature to Augment Defect Classification for the ASE Dataset | Chih-Chung Hsu et.al. | 2404.05183 | null |
2024-04-08 | QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis | Junlin Hou et.al. | 2404.05169 | null |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-07 | LOGO: A Long-Form Video Dataset for Group Action Quality Assessment | Shiyi Zhang et.al. | 2404.05029 | link |
2024-04-07 | Dual-Scale Transformer for Large-Scale Single-Pixel Imaging | Gang Qu et.al. | 2404.05001 | link |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-04-06 | Convolutional Neural Network Transformer (CNNT) for Fluorescence Microscopy image Denoising with Improved Generalization and Fast Adaptation | Azaan Rehman et.al. | 2404.04726 | null |
2024-04-09 | Computation and Critical Transitions of Rate-Distortion-Perception Functions With Wasserstein Barycenter | Chunhui Chen et.al. | 2404.04681 | null |
2024-04-06 | FastHDRNet: A new efficient method for SDR-to-HDR Translation | Siyuan Tian et.al. | 2404.04483 | null |
2024-04-06 | RoNet: Rotation-oriented Continuous Image Translation | Yi Li et.al. | 2404.04474 | null |
2024-04-05 | Physics-Inspired Synthesized Underwater Image Dataset | Reina Kaneko et.al. | 2404.03998 | null |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment | Chunyi Li et.al. | 2404.03407 | null |
2024-04-04 | DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement | Shangquan Sun et.al. | 2404.03327 | null |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-02 | Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models | Jiachen Ma et.al. | 2404.02928 | null |
2024-04-03 | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Keyu Tian et.al. | 2404.02905 | link |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | Imaging transformer for MRI denoising with the SNR unit training: enabling generalization across field-strengths, imaging contrasts, and anatomy | Hui Xue et.al. | 2404.02382 | null |
2024-04-02 | DSGNN: A Dual-View Supergrid-Aware Graph Neural Network for Regional Air Quality Estimation | Xin Zhang et.al. | 2404.01975 | null |
2024-04-02 | Event-assisted Low-Light Video Object Segmentation | Hebei Li et.al. | 2404.01945 | link |
2024-04-02 | PATCH – Psychometrics-AssisTed benCHmarking of Large Language Models: A Case Study of Mathematics Proficiency | Qixiang Fang et.al. | 2404.01799 | link |
2024-04-02 | Super-Resolution Analysis for Landfill Waste Classification | Matias Molina et.al. | 2404.01790 | null |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-04-02 | Boosting Visual Recognition for Autonomous Driving in Real-world Degradations with Deep Channel Prior | Zhanwen Liu et.al. | 2404.01703 | link |
2024-04-02 | A CT Image Denoising Method with Residual Encoder-Decoder Network | Helena Shawn et.al. | 2404.01553 | null |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | New infrared camera of the Caucasian Mountain Observatory of the SAI MSU: design, main parameters, and first light | S. G. Zheltoukhov et.al. | 2404.01246 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-04-01 | AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images | Liu Yang et.al. | 2404.01024 | link |
2024-04-01 | Digital Twins for Supporting AI Research with Autonomous Vehicle Networks | Anıl Gürses et.al. | 2404.00954 | null |
2024-04-01 | Towards Memorization-Free Diffusion Models | Chen Chen et.al. | 2404.00922 | null |
2024-04-01 | Model-Agnostic Human Preference Inversion in Diffusion Models | Jeeyung Kim et.al. | 2404.00879 | null |
2024-03-31 | GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration | Youssef Mansour et.al. | 2404.00807 | null |
2024-03-31 | Personalized Neural Speech Codec | Inseon Jang et.al. | 2404.00791 | null |
2024-04-02 | DRCT: Saving Image Super-resolution away from Information Bottleneck | Chih-Chung Hsu et.al. | 2404.00722 | link |
2024-03-30 | Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network | Md Hassanuzzaman et.al. | 2404.00470 | null |
2024-03-30 | Harmonizing Light and Darkness: A Symphony of Prior-guided Data Synthesis and Adaptive Focus for Nighttime Flare Removal | Lishen Qu et.al. | 2404.00313 | null |
2024-03-30 | Learned Scanpaths Aid Blind Panoramic Video Quality Assessment | Kanglong Fan et.al. | 2404.00252 | link |
2024-03-29 | Evolving Semantic Communication with Generative Model | Shunpu Tang et.al. | 2403.20237 | link |
2024-03-29 | Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context | Tuan Nguyen et.al. | 2403.20184 | null |
2024-03-29 | Unsupervised Tumor-Aware Distillation for Multi-Modal Brain Image Translation | Chuan Huang et.al. | 2403.20168 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-28 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | null |
2024-03-28 | DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation | Haonan Lin et.al. | 2403.19235 | null |
2024-03-28 | AAPMT: AGI Assessment Through Prompt and Metric Transformer | Benhao Huang et.al. | 2403.19101 | link |
2024-03-27 | TextCraftor: Your Text Encoder Can be Image Quality Controller | Yanyu Li et.al. | 2403.18978 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | Bringing Textual Prompt to AI-Generated Image Quality Assessment | Bowen Qu et.al. | 2403.18714 | link |
2024-03-27 | qIoV: A Quantum-Driven Internet-of-Vehicles-Based Approach for Environmental Monitoring and Rapid Response Systems | Ankur Nahar et.al. | 2403.18622 | null |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning – A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | Don’t Look into the Dark: Latent Codes for Pluralistic Image Inpainting | Haiwei Chen et.al. | 2403.18186 | null |
2024-03-26 | Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model | Weijie Gan et.al. | 2403.18139 | null |
2024-03-26 | TDIP: Tunable Deep Image Processing, a Real Time Melt Pool Monitoring Solution | Javid Akhavan et.al. | 2403.18117 | null |
2024-03-26 | Cross-system biological image quality enhancement based on the generative adversarial network as a foundation for establishing a multi-institute microscopy cooperative network | Dominik Panek et.al. | 2403.18026 | null |
2024-03-26 | Improving Text-to-Image Consistency via Automatic Prompt Optimization | Oscar Mañas et.al. | 2403.17804 | null |
2024-03-26 | Can patient-specific acquisition protocol improve performance on defect detection task in myocardial perfusion SPECT? | Nu Ri Choi et.al. | 2403.17764 | null |
2024-03-26 | Panonut360: A Head and Eye Tracking Dataset for Panoramic Video | Yutong Xu et.al. | 2403.17708 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | ExpressEdit: Video Editing with Natural Language and Sketching | Bekzat Tilekbay et.al. | 2403.17693 | null |
2024-03-26 | Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis | Jingyu Xu et.al. | 2403.17549 | null |
2024-03-26 | ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales? | Fan Huang et.al. | 2403.17368 | link |
2024-03-26 | AutoMRISimQA: an automated system for daily quality control of a 3T MRI simulator | Aitang Xing et.al. | 2403.17365 | null |
2024-03-25 | Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models | Li Qiao et.al. | 2403.17256 | null |
2024-03-25 | PROSPECT: Precision Robot Spectroscopy Exploration and Characterization Tool | Nathaniel Hanson et.al. | 2403.17232 | null |
2024-03-25 | Comp4D: LLM-Guided Compositional 4D Scene Generation | Dejia Xu et.al. | 2403.16993 | null |
2024-03-25 | Towards Low-Latency and Energy-Efficient Hybrid P2P-CDN Live Video Streaming | Reza Farahani et.al. | 2403.16985 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-25 | C-arm inverse geometry CT for 3D cardiac chamber mapping | Jordan M. Slagowski et.al. | 2403.16779 | null |
2024-03-25 | FOOL: Addressing the Downlink Bottleneck in Satellite Computing with Neural Feature Compression | Alireza Furutanpey et.al. | 2403.16677 | link |
2024-03-25 | Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network | Yijin Zhou et.al. | 2403.16540 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Plaintext-Free Deep Learning for Privacy-Preserving Medical Image Analysis via Frequency Information Embedding | Mengyu Sun et.al. | 2403.16473 | null |
2024-03-25 | Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging | Jintong Hu et.al. | 2403.16384 | link |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-24 | Passive Screen-to-Camera Communication | Seyed Keyarash Ghiasi et.al. | 2403.16185 | null |
2024-03-24 | Argument Quality Assessment in the Age of Instruction-Following Large Language Models | Henning Wachsmuth et.al. | 2403.16084 | null |
2024-03-23 | An edge detection-based deep learning approach for tear meniscus height measurement | Kesheng Wang et.al. | 2403.15853 | null |
2024-03-22 | Medical Image Data Provenance for Medical Cyber-Physical System | Vijay Kumar et.al. | 2403.15522 | null |
2024-03-22 | Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression | Hongyan Liu et.al. | 2403.15379 | link |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | link |
2024-03-22 | Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos | Abhinau K. Venkataramanan et.al. | 2403.15061 | null |
2024-03-21 | On the exploitation of DCT statistics for cropping detectors | Claudio Vittorio Ragaglia et.al. | 2403.14789 | null |
2024-03-21 | From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation | Haofei Zhao et.al. | 2403.14118 | null |
2024-03-20 | Multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers | Ignacy Stępka et.al. | 2403.13940 | link |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Hierarchical NeuroSymbolic Approach for Action Quality Assessment | Lauren Okamoto et.al. | 2403.13798 | link |
2024-03-20 | Step-Calibrated Diffusion for Biomedical Optical Image Restoration | Yiwei Lyu et.al. | 2403.13680 | link |
2024-03-20 | Defining metric-aware size-shape measures to validate and optimize curved high-order meshes | Guillermo Aparicio-Estrems et.al. | 2403.13528 | null |
2024-03-20 | AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation | Jingkun An et.al. | 2403.13352 | null |
2024-03-20 | Learning Novel View Synthesis from Heterogeneous Low-light Captures | Quan Zheng et.al. | 2403.13337 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-18 | Invisible Backdoor Attack Through Singular Value Decomposition | Wenmin Chen et.al. | 2403.13018 | null |
2024-03-19 | Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference | Baolin Li et.al. | 2403.12900 | null |
2024-03-19 | VisualCritic: Making LMMs Perceive Visual Quality Like Humans | Zhipeng Huang et.al. | 2403.12806 | null |
2024-03-19 | Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean | Dojun Park et.al. | 2403.12666 | link |
2024-03-19 | GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation | Quankai Gao et.al. | 2403.12365 | null |
2024-03-19 | Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial | Mengzhou Li et.al. | 2403.12331 | null |
2024-03-18 | Motion and temporal B0 shift corrections for quantitative susceptibility mapping (QSM) and R2* mapping using dual-echo spiral navigators and conjugate-phase reconstruction | Yuguang Meng et.al. | 2403.12230 | null |
2024-03-19 | Generic 3D Diffusion Adapter Using Controlled Multi-View Editing | Hansheng Chen et.al. | 2403.12032 | link |
2024-03-18 | Enhancing Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Bo-Han Lu et.al. | 2403.12024 | link |
2024-03-18 | VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model | Qi Zuo et.al. | 2403.12010 | null |
2024-03-19 | Subjective-Aligned Dateset and Metric for Text-to-Video Quality Assessment | Tengchuan Kou et.al. | 2403.11956 | link |
2024-03-18 | HyperColorization: Propagating spatially sparse noisy spectral clues for reconstructing hyperspectral images | M. Kerem Aydin et.al. | 2403.11935 | link |
2024-03-18 | Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics | Sebastian Hartwig et.al. | 2403.11821 | null |
2024-03-18 | Hallucination in Perceptual Metric-Driven Speech Enhancement Networks | George Close et.al. | 2403.11732 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Teppei Suzuki et.al. | 2403.11460 | link |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-18 | Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization | Yujia Liu et.al. | 2403.11397 | link |
2024-03-18 | Simulating Wearable Urban Augmented Reality Experiences in VR: Lessons Learnt from Designing Two Future Urban Interfaces | Tram Thi Minh Tran et.al. | 2403.11377 | null |
2024-03-17 | Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction | Xue Bai et.al. | 2403.11337 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-17 | Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment | Lorenzo Agnolucci et.al. | 2403.11176 | link |
2024-03-17 | Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model | Dian Zheng et.al. | 2403.11157 | link |
2024-03-17 | Interactive $360^{\circ}$ Video Streaming Using FoV-Adaptive Coding with Temporal Prediction | Yixiang Mao et.al. | 2403.11155 | null |
2024-03-17 | Hierarchical Generative Network for Face Morphing Attacks | Zuyuan He et.al. | 2403.11101 | null |
2024-03-17 | Endora: Video Generation Models as Endoscopy Simulators | Chenxin Li et.al. | 2403.11050 | null |
2024-03-16 | A Spectrum-based Image Denoising Method with Edge Feature Enhancement | Peter Luvton et.al. | 2403.11036 | null |
2024-03-16 | Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming | Amritha Premkumar et.al. | 2403.10976 | link |
2024-03-16 | A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | Tianhe Wu et.al. | 2403.10854 | link |
2024-03-16 | MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | Mude Hui et.al. | 2403.10815 | link |
2024-03-16 | ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models | Yuwen Chen et.al. | 2403.10786 | null |
2024-03-15 | Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation | Anton Pelykh et.al. | 2403.10731 | link |
2024-03-15 | EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction | Yipeng Sun et.al. | 2403.10695 | link |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-21 | Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment | Yixiao Li et.al. | 2403.10406 | null |
2024-03-15 | PASTA: Towards Flexible and Efficient HDR Imaging Via Progressively Aggregated Spatio-Temporal Aligment | Xiaoning Liu et.al. | 2403.10376 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | link |
2024-03-15 | Context-Semantic Quality Awareness Network for Fine-Grained Visual Categorization | Qin Xu et.al. | 2403.10298 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | Perceptual Quality-based Model Training under Annotator Label Uncertainty | Chen Zhou et.al. | 2403.10190 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179 | null |
2024-03-15 | PQDynamicISP: Dynamically Controlled Image Signal Processor for Any Image Sensors Pursuing Perceptual Quality | Masakazu Yoshimura et.al. | 2403.10091 | null |
2024-03-15 | Learning Physical Dynamics for Object-centric Visual Prediction | Huilin Xu et.al. | 2403.10079 | null |
2024-03-15 | Contrastive Pre-Training with Multi-View Fusion for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10066 | null |
2024-03-15 | PAME: Self-Supervised Masked Autoencoder for No-Reference Point Cloud Quality Assessment | Ziyu Shan et.al. | 2403.10061 | null |
2024-03-14 | ProMark: Proactive Diffusion Watermarking for Causal Attribution | Vishal Asnani et.al. | 2403.09914 | null |
2024-03-14 | MultiGripperGrasp: A Dataset for Robotic Grasping from Parallel Jaw Grippers to Dexterous Hands | Luis Felipe Casas Murrilo et.al. | 2403.09841 | null |
2024-03-13 | PICNIQ: Pairwise Comparisons for Natural Image Quality Assessment | Nicolas Chahine et.al. | 2403.09746 | link |
2024-03-14 | Renovating Names in Open-Vocabulary Segmentation Benchmarks | Haiwen Huang et.al. | 2403.09593 | null |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | link |
2024-03-20 | D-YOLO a robust framework for object detection in adverse weather conditions | Zihan Chu et.al. | 2403.09233 | null |
2024-03-14 | Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Byeongjun Park et.al. | 2403.09176 | link |
2024-03-14 | Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse | Jianwei Sun et.al. | 2403.09167 | null |
2024-03-15 | NTIRE 2023 Image Shadow Removal Challenge Technical Report: Team IIM_TTI | Yuki Kondo et.al. | 2403.08995 | link |
2024-03-13 | Structural Positional Encoding for knowledge integration in transformer-based medical process monitoring | Christopher Irwin et.al. | 2403.08836 | link |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | link |
2024-03-13 | Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment | Paraskevas Pegios et.al. | 2403.08700 | null |
2024-03-13 | Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages | Rik van Noord et.al. | 2403.08693 | null |
2024-03-13 | Physics-Guided Inverse Regression for Crop Quality Assessment | David Shulman et.al. | 2403.08653 | null |
2024-03-14 | GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Xinjie Zhang et.al. | 2403.08551 | link |
2024-03-13 | Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Christos Papadimitriou et.al. | 2403.08502 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | Protocol Optimization for Functional Cardiac CT Imaging Using Noise Emulation in the Raw Data Domain | Zhye Yin et.al. | 2403.08486 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | AADNet: Attention aware Demoiréing Network | M Rakesh Reddy et.al. | 2403.08384 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-Labels | Minsoo Kim et.al. | 2403.08256 | null |
2024-03-13 | PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping | Jiafu Chen et.al. | 2403.08252 | null |
2024-03-15 | A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT | Hongyang Zhu et.al. | 2403.08247 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-18 | BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives | Ivo M. Baltruschat et.al. | 2403.07800 | null |
2024-03-12 | Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation | Michael Ogezi et.al. | 2403.07605 | null |
2024-03-12 | Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution | Haochen Sun et.al. | 2403.07390 | null |
2024-03-12 | Time-Efficient Light-Field Acquisition Using Coded Aperture and Events | Shuji Habuchi et.al. | 2403.07244 | null |
2024-03-10 | Propensity-score matching analysis in COVID-19-related studies: a method and quality systematic review | Chunhui Gu et.al. | 2403.07023 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Applicability of oculomics for individual risk prediction: Repeatability and robustness of retinal Fractal Dimension using DART and AutoMorph | Justin Engelmann et.al. | 2403.06950 | null |
2024-03-11 | Monitoring the Venice Lagoon: an IoT Cloud-Based Sensor Nerwork Approach | Filippo Campagnaro et.al. | 2403.06915 | null |
2024-03-11 | COOD: Combined out-of-distribution detection using multiple measures for anomaly & novel class detection in large-scale hierarchical classification | L. E. Hogeweg et.al. | 2403.06874 | null |
2024-03-20 | QUASAR: QUality and Aesthetics Scoring with Advanced Representations | Sergey Kastryulin et.al. | 2403.06866 | null |
2024-03-11 | A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos | Weixia Zhang et.al. | 2403.06421 | link |
2024-03-11 | Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents | Weixia Zhang et.al. | 2403.06406 | null |
2024-03-11 | Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Yang Zhang et.al. | 2403.06381 | link |
2024-03-15 | ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge | Sami Khairy et.al. | 2403.06324 | link |
2024-03-10 | Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising | Yuang Wang et.al. | 2403.06069 | null |
2024-03-09 | IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics | Ekaterina Shumitskaya et.al. | 2403.05955 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-08 | Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis | Muxi Chen et.al. | 2403.05125 | link |
2024-03-08 | CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model | Pengwei Yin et.al. | 2403.05124 | null |
2024-03-08 | Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile | Seokjun Lee et.al. | 2403.05093 | link |
2024-03-08 | Improving Diffusion-Based Generative Models via Approximated Optimal Transport | Daegyu Kim et.al. | 2403.05069 | link |
2024-03-08 | PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | Zewen Chen et.al. | 2403.04993 | link |
2024-03-08 | StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models | Lezhong Wang et.al. | 2403.04965 | link |
2024-03-07 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-03-17 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | link |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | FriendNet: Detection-Friendly Dehazing Network | Yihua Fan et.al. | 2403.04443 | link |
2024-03-07 | MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment | Kanglei Zhou et.al. | 2403.04398 | link |
2024-03-07 | Self-Evaluation of Large Language Model based on Glass-box Features | Hui Huang et.al. | 2403.04222 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | Development and evaluation of Artificial Intelligence techniques for IoT data quality assessment and curation | Laura Martín et.al. | 2403.03661 | null |
2024-03-06 | A Connector for Integrating NGSI-LD Data into Open Data Portals | Laura Martín et.al. | 2403.03648 | null |
2024-03-06 | Low-Dose CT Image Reconstruction by Fine-Tuning a UNet Pretrained for Gaussian Denoising for the Downstream Task of Image Enhancement | Tim Selig et.al. | 2403.03551 | null |
2024-03-06 | Combined optimization ghost imaging based on random speckle field | Zhiqing Yang et.al. | 2403.03426 | null |
2024-03-06 | DaISy: Diffuser-aided Sub-THz Imaging System | Shao-Hsuan Wu et.al. | 2403.03383 | null |
2024-03-05 | Imaging the event horizon of M87* from space on different timescales | Anastasia Shlentsova et.al. | 2403.03327 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | link |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | DIFNet: SAR RFI suppression based on domain invariant features | Fuping Fang et.al. | 2403.02894 | null |
2024-03-05 | Rehabilitation Exercise Quality Assessment through Supervised Contrastive Learning with Hard and Soft Negatives | Mark Karlov et.al. | 2403.02772 | null |
2024-03-04 | Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection | Shitao Chen et.al. | 2403.01978 | link |
2024-03-04 | Revisiting the dust torus size-luminosity relation based on a uniform reverberation mapping analysis | Amit Kumar Mandal et.al. | 2403.01885 | null |
2024-03-04 | PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis | Zhengyao Lv et.al. | 2403.01852 | link |
2024-03-04 | ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein et.al. | 2403.01807 | link |
2024-03-04 | Development of a near-infrared wide-field integral field unit by ultra-precision diamond cutting | Kosuke Kushibiki et.al. | 2403.01668 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-05 | 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos | Jiakai Sun et.al. | 2403.01444 | link |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images | Shufan Pei et.al. | 2403.01083 | null |
2024-03-02 | LLMCRIT: Teaching Large Language Models to Use Criteria | Weizhe Yuan et.al. | 2403.01069 | link |
2024-03-01 | Near-Real-Time Mueller Polarimetric Image Processing for Neurosurgical Intervention | Stefano Moriconi et.al. | 2403.00893 | null |
2024-03-01 | Gate-set evaluation metrics for closed-loop optimal control on nitrogen-vacancy center ensembles in diamond | Philipp J. Vetter et.al. | 2403.00616 | null |
2024-03-01 | Equilibrium Model with Anisotropy for Model-Based Reconstruction in Magnetic Particle Imaging | Marco Maass et.al. | 2403.00602 | link |
2024-03-01 | Data Quality Assessment: Challenges and Opportunities | Sedir Mohammed et.al. | 2403.00526 | null |
2024-03-01 | Phase retrieval beyond the homogeneous object assumption for X-ray in-line holographic imaging | Jens Lucht et.al. | 2403.00461 | null |
2024-03-01 | An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels | Shumpei Takezaki et.al. | 2403.00452 | null |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-03-01 | List-Mode PET Image Reconstruction Using Dykstra-Like Splitting | Kibo Ote et.al. | 2403.00394 | null |
2024-03-01 | Optimization of Array Encoding for Ultrasound Imaging | Jacob Spainhour et.al. | 2403.00289 | link |
2024-03-01 | Deep-learning-based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image Decoding | Satoshi Ito et.al. | 2403.00220 | null |
2024-03-03 | RoadRunner - Learning Traversability Estimation for Autonomous Off-road Driving | Jonas Frey et.al. | 2402.19341 | null |
2024-02-29 | Integral field spectroscopy supports atmospheric optics to reveal the finite outer scale of the turbulence | Begoña García-Lorenzo et.al. | 2402.19337 | null |
2024-03-13 | Modular Blind Video Quality Assessment | Wen Wen et.al. | 2402.19276 | link |
2024-02-29 | Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz et.al. | 2402.19215 | link |
2024-02-29 | Disentangling representations of retinal images with generative models | Sarah Müller et.al. | 2402.19186 | link |
2024-02-29 | Trajectory Consistency Distillation | Jianbin Zheng et.al. | 2402.19159 | link |
2024-02-29 | Atmospheric Turbulence Removal with Video Sequence Deep Visual Priors | P. Hill et.al. | 2402.19041 | null |
2024-02-28 | Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge et.al. | 2402.18191 | link |
2024-02-28 | NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes | Cidan Shi et.al. | 2402.18172 | link |
2024-03-02 | G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment | Juan Zhang et.al. | 2402.18122 | null |
2024-02-28 | Improvement Of Audiovisual Quality Estimation Using A Nonlinear Autoregressive Exogenous Neural Network And Bitstream Parameters | Koffi Kossi et.al. | 2402.18056 | null |
2024-02-28 | PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis | Jason J. Yu et.al. | 2402.17986 | null |
2024-02-28 | Rapid hyperspectral photothermal mid-infrared spectroscopic imaging from sparse data for gynecologic cancer tissue subtyping | Reza Reihanisaransari et.al. | 2402.17960 | null |
2024-02-29 | QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction | Ishak Ayad et.al. | 2402.17951 | null |
2024-02-27 | Accelerated Real-time Cine and Flow under In-magnet Staged Exercise | Preethi Chandrasekaran et.al. | 2402.17877 | null |
2024-02-27 | A Performance Evaluation of Filtered Delay Multiply and Sum Beamforming for Ultrasound Localization Microscopy: Preliminary Results | A. N. Madhavanunni et.al. | 2402.17643 | null |
2024-02-28 | Black-box Adversarial Attacks Against Image Quality Assessment Models | Yu Ran et.al. | 2402.17533 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Sora Generates Videos with Stunning Geometrical Consistency | Xuanyi Li et.al. | 2402.17403 | null |
2024-03-10 | Learning Exposure Correction in Dynamic Scenes | Jin Liu et.al. | 2402.17296 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-03-01 | Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System | Majid Memari et.al. | 2402.17204 | null |
2024-03-19 | Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain | Qunliang Xing et.al. | 2402.17200 | null |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-27 | T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality | Susan Epstein et.al. | 2402.17101 | null |
2024-02-26 | Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids | Jasper Kirton-Wingate et.al. | 2402.16757 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-03-04 | Towards Open-ended Visual Quality Comparison | Haoning Wu et.al. | 2402.16641 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues | Tassadaq Hussain et.al. | 2402.16394 | null |
2024-02-26 | Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Szu-Wei Fu et.al. | 2402.16321 | link |
2024-02-24 | Design, Implementation and Analysis of a Compressed Sensing Photoacoustic Projection Imaging System | Markus Haltmeier et.al. | 2402.15750 | null |
2024-02-23 | Benchmarking the Robustness of Panoptic Segmentation for Automated Driving | Yiting Wang et.al. | 2402.15469 | null |
2024-02-23 | Ten computational challenges in human virome studies | Yifan Wu et.al. | 2402.15186 | null |
2024-02-23 | The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling | Jiajun Ma et.al. | 2402.15170 | null |
2024-02-22 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | Willi Menapace et.al. | 2402.14797 | null |
2024-02-25 | Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening | Zhenrong Shen et.al. | 2402.14707 | null |
2024-02-22 | Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment | Zhaoyang Wang et.al. | 2402.14401 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-20 | Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control | Denis Lukovnikov et.al. | 2402.13404 | null |
2024-02-24 | Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference | Aytaç Özkan et.al. | 2402.12735 | null |
2024-02-20 | Simpson’s Paradox and the Accuracy-Fluency Tradeoff in Translation | Zheng Wei Lim et.al. | 2402.12690 | null |
2024-02-21 | Robust-Wide: Robust Watermarking against Instruction-driven Image Editing | Runyi Hu et.al. | 2402.12688 | link |
2024-02-20 | X-ray multibeam ptychography at up to 20 keV: nano-lithography enhances X-ray nano-imaging | Tang Li et.al. | 2402.12082 | null |
2024-02-19 | A Lightweight Parallel Framework for Blind Image Quality Assessment | Qunyue Huang et.al. | 2402.12043 | null |
2024-02-18 | Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs | Arian Askari et.al. | 2402.11633 | link |
2024-02-16 | Path Loss Modeling for RIS-Assisted Wireless System with Direct Link and Elevation Factors | Vinay Kumar Chapala et.al. | 2402.10419 | null |
2024-02-15 | Deep Spectral Meshes: Multi-Frequency Facial Mesh Processing with Graph Neural Networks | Robert Kosk et.al. | 2402.10365 | null |
2024-02-15 | Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community | Arman Isajanyan et.al. | 2402.09872 | link |
2024-02-15 | How to Train Data-Efficient LLMs | Noveen Sachdeva et.al. | 2402.09668 | null |
2024-02-14 | TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction | Xueqi Guo et.al. | 2402.09567 | null |
2024-02-14 | Assessing test artifact quality – A tertiary study | Huynh Khanh Vi Tran et.al. | 2402.09541 | null |
2024-02-14 | LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning | Adithya Raman et.al. | 2402.09392 | null |
2024-02-14 | Generalized Portrait Quality Assessment | Nicolas Chahine et.al. | 2402.09178 | link |
Super Resolution
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution | Chengxing Xie et.al. | 2411.17214 | null |
2024-11-26 | PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Libo Zhu et.al. | 2411.17106 | link |
2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | null |
2024-11-25 | ZoomLDM: Latent Diffusion Model for multi-scale image generation | Srikar Yellapragada et.al. | 2411.16969 | null |
2024-11-25 | From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task | Bohao Chen et.al. | 2411.16792 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | High-Resolution Be Aware! Improving the Self-Supervised Real-World Super-Resolution | Yuehan Zhang et.al. | 2411.16175 | null |
2024-11-23 | FFT-Enhanced Low-Complexity Near-Field Super-Resolution Sensing | Yuxiao Wu et.al. | 2411.15532 | null |
2024-11-21 | UPdec-Webb: A Dataset for Coaddition of JWST NIRCam Images | Lei Wang et.al. | 2411.13891 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | Adversarial Diffusion Compression for Real-World Image Super-Resolution | Bin Chen et.al. | 2411.13383 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-19 | Efficient Medicinal Image Transmission and Resolution Enhancement via GAN | Rishabh Kumar Sharma et.al. | 2411.12833 | null |
2024-11-19 | ISAC Super-Resolution Receivers: The Effect of Different Dictionary Matrices | Iman Valiulahi et.al. | 2411.12672 | null |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-18 | Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Brian B. Moser et.al. | 2411.12072 | link |
2024-11-16 | $\text{S}^{3}$ Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model | Peizhe Xia et.al. | 2411.11906 | null |
2024-11-17 | Low-Complexity Algorithms for Multichannel Spectral Super-Resolution | Xunmeng Wu et.al. | 2411.10938 | null |
2024-11-21 | Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution | Long Peng et.al. | 2411.10798 | null |
2024-11-15 | Experimental demonstration of Tessellation Structured Illumination Microscopy | Doron Shterman et.al. | 2411.10405 | null |
2024-11-15 | A Low-Resolution Image is Worth 1x1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift | Sanath Budakegowdanadoddi Nagaraju et.al. | 2411.10231 | null |
2024-11-15 | DiffFNO: Diffusion Fourier Neural Operator | Xiaoyi Liu et.al. | 2411.09911 | null |
2024-11-15 | Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements | Shijie Zhou et.al. | 2411.09850 | null |
2024-11-14 | OneNet: A Channel-Wise 1D Convolutional U-Net | Sanghyun Byun et.al. | 2411.09838 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | ISAC Super-Resolution Receiver via Lifted Atomic Norm Minimization | Iman Valiulahi et.al. | 2411.09495 | null |
2024-11-14 | Evaluation of RIS-Enabled B5G/6G Indoor Positioning and Mapping using Ray Tracing Models | Dimitris Kompostiotis et.al. | 2411.09440 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | Performance Boundaries and Tradeoffs in Super-Resolution Imaging Technologies for Space Targets | XiaoLe He et.al. | 2411.09155 | null |
2024-11-12 | On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction | Tao Hong et.al. | 2411.08178 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution | Aditya Kasliwal et.al. | 2411.07750 | null |
2024-11-12 | Numerical Homogenization by Continuous Super-Resolution | Zhi-Song Liu et.al. | 2411.07576 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound | Sepideh K. Gharamaleki et.al. | 2411.07376 | null |
2024-11-11 | AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models | Wallace Abreu et.al. | 2411.07364 | null |
2024-11-13 | General Geospatial Inference with a Population Dynamics Foundation Model | Mohit Agarwal et.al. | 2411.07207 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Expansion microscopy reveals neural circuit organization in genetic animal models | Shakila Behzadi et.al. | 2411.06676 | null |
2024-11-10 | Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution | Minghong Duan et.al. | 2411.06442 | link |
2024-11-10 | SuperResolution Radar Gesture Recognitio | Netanel Blumenfeld et.al. | 2411.06410 | null |
2024-11-09 | Quasi-Newton OMP Approach for Super-Resolution Channel Estimation and Extrapolation | Yi Zeng et.al. | 2411.06082 | null |
2024-11-09 | Predicting band structures for 2D Photonic Crystals via Deep Learning | Yueqi Wang et.al. | 2411.06063 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning | Xiangyu Zhao et.al. | 2411.05420 | null |
2024-11-08 | Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons | Rahul Gulati et.al. | 2411.05329 | null |
2024-11-07 | Reducing data resolution for better super-resolution: Reconstructing turbulent flows from noisy observation | Kyongmin Yeo et.al. | 2411.05240 | null |
2024-11-07 | ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing | Zhihui Zhang et.al. | 2411.04706 | null |
2024-11-06 | “Super-resolution” holographic optical tweezers array | Keisuke Nishimura et.al. | 2411.03564 | null |
2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
2024-11-05 | Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution | Huan Zheng et.al. | 2411.03239 | null |
2024-11-05 | Applications of Automatic Differentiation in Image Registration | Warin Watson et.al. | 2411.02806 | link |
2024-11-05 | Super-resolution generalized eigenvalue method with truly sub-Nyquist sampling | Baoguo Liu et.al. | 2411.02700 | null |
2024-11-01 | Strongly Topology-preserving GNNs for Brain Graph Super-resolution | Pragya Singh et.al. | 2411.02525 | null |
2024-11-03 | Super-Resolution without High-Resolution Labels for Black Hole Simulations | Thomas Helfer et.al. | 2411.02453 | link |
2024-11-04 | MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D | Wei Cheng et.al. | 2411.02336 | null |
2024-11-01 | A Robust Super-Resolution Classifier by Nonlinear Optics | Ishan Darji et.al. | 2411.00953 | link |
2024-10-31 | Blind Time-of-Flight Imaging: Sparse Deconvolution on the Continuum with Unknown Kernels | Ruiming Guo et.al. | 2411.00893 | null |
2024-11-01 | Constrained Diffusion Implicit Models | Vivek Jayaram et.al. | 2411.00359 | null |
2024-10-31 | DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination | Jia Fu et.al. | 2410.24006 | link |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-10-30 | Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA | Ankur Garg et.al. | 2410.23319 | null |
2024-10-30 | EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models | Shangquan Sun et.al. | 2410.22959 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-29 | Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images | Suhyun Ahn et.al. | 2410.21826 | link |
2024-10-29 | Fingerprints of Super Resolution Networks | Jeremy Vonderfecht et.al. | 2410.21653 | null |
2024-10-30 | Super-resolution in disordered media using neural networks | Alexander Christie et.al. | 2410.21556 | null |
2024-10-28 | Super-resolution with dynamics in the loss | Jacob Page et.al. | 2410.20884 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution | Zhicheng Zhao et.al. | 2410.20466 | link |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-25 | A Flow-based Truncated Denoising Diffusion Model for Super-resolution Magnetic Resonance Spectroscopic Imaging | Siyuan Dong et.al. | 2410.19288 | null |
2024-10-24 | A Spectral-based Physics-informed Finite Operator Learning for Prediction of Mechanical Behavior of Microstructures | Ali Harandi et.al. | 2410.19027 | null |
2024-10-25 | Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis | Yanguang Zhao et.al. | 2410.18698 | null |
2024-10-24 | Hyperspectral Spatial Super-Resolution using Keystone Error | Ankur Garg et.al. | 2410.18691 | null |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083 | null |
2024-10-23 | A Wavelet Diffusion GAN for Image Super-Resolution | Lorenzo Aloisi et.al. | 2410.17966 | null |
2024-10-23 | Truly Sub-Nyquist Method Based Matrix Pencil and CRT with Super Resolution | Huiguang Zhang et.al. | 2410.17841 | null |
2024-10-23 | AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution | Yuanting Fan et.al. | 2410.17752 | null |
2024-10-23 | Generalizable Motion Planning via Operator Learning | Sharath Matada et.al. | 2410.17547 | null |
2024-10-22 | Multi Kernel Estimation based Object Segmentation | Haim Goldfisher et.al. | 2410.17064 | link |
2024-10-22 | Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Giannis Daras et.al. | 2410.16152 | null |
2024-10-21 | MINFLUX – molecular resolution with minimal photons | Lukas Scheiderer et.al. | 2410.15902 | null |
2024-10-18 | Ultrasound matrix imaging for transcranial in-vivo localization microscopy | Flavien Bureau et.al. | 2410.14499 | null |
2024-10-18 | Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques | Yugandhar Reddy Gogireddy et.al. | 2410.14285 | null |
2024-10-18 | ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer | Yuhao Wan et.al. | 2410.14279 | null |
2024-10-17 | MMAD-Purify: A Precision-Optimized Framework for Efficient and Scalable Multi-Modal Attacks | Xinxin Liu et.al. | 2410.14089 | null |
2024-10-17 | ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution | Junhao Gu et.al. | 2410.13807 | null |
2024-10-17 | Unsupervised Skull Segmentation via Contrastive MR-to-CT Modality Translation | Kamil Kwarciak et.al. | 2410.13427 | null |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Transformer based super-resolution downscaling for regional reanalysis: Full domain vs tiling approaches | Antonio Pérez et.al. | 2410.12728 | null |
2024-10-16 | Approximations of MINFLUX Localization Precision with Background | Zach Marin et.al. | 2410.12427 | null |
2024-10-16 | Superoscillation focusing of high-order cylindrical-vector beams | Zhongwei Jin et.al. | 2410.12335 | null |
2024-10-15 | Temporal resolution enhancement in Structured Illumination Microscopy using cascaded reconstruction | Doron Shterman et.al. | 2410.11770 | null |
2024-10-15 | Degradation Oriented and Regularized Network for Real-World Depth Super-Resolution | Zhengxue Wang et.al. | 2410.11666 | link |
2024-10-15 | Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution | Hongyu An et.al. | 2410.11506 | link |
2024-10-14 | Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution | Junbo Qiao et.al. | 2410.10140 | null |
2024-10-14 | Optimizing Fingerprint-Spectrum-Based Synchronization in Integrated Sensing and Communications | Xiao-Yang Wang et.al. | 2410.10134 | null |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-13 | Conditioning 3D Diffusion Models with 2D Images: Towards Standardized OCT Volumes through En Face-Informed Super-Resolution | Coen de Vente et.al. | 2410.09862 | null |
2024-10-13 | HASN: Hybrid Attention Separable Network for Efficient Image Super-resolution | Weifeng Cao et.al. | 2410.09844 | link |
2024-10-11 | Riemannian Gradient Descent Method to Joint Blind Super-Resolution and Demixing in ISAC | Zeyu Xiang et.al. | 2410.08607 | null |
2024-10-11 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-10 | TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution | Sohwi Kim et.al. | 2410.07663 | null |
2024-10-09 | HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution | Hua Li et.al. | 2410.06488 | link |
2024-10-09 | MaskBlur: Spatial and Angular Data Augmentation for Light Field Image Super-Resolution | Wentao Chao et.al. | 2410.06478 | link |
2024-10-17 | SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution | Qi Tang et.al. | 2410.05799 | link |
2024-10-07 | Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes | Omar Elezabi et.al. | 2410.05410 | null |
2024-10-07 | Near-Field ISAC in 6G: Addressing Phase Nonlinearity via Lifted Super-Resolution | Sajad Daei et.al. | 2410.04930 | null |
2024-10-05 | AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results | Ivan Molodetskikh et.al. | 2410.04225 | null |
2024-10-10 | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Jianze Li et.al. | 2410.04224 | link |
2024-10-05 | Exploring Strengths and Weaknesses of Super-Resolution Attack in Deepfake Detection | Davide Alessandro Coccomini et.al. | 2410.04205 | null |
2024-10-05 | TV-based Deep 3D Self Super-Resolution for fMRI | Fernando Pérez-Bueno et.al. | 2410.04097 | null |
2024-10-04 | Learning Truncated Causal History Model for Video Restoration | Amirhosein Ghasemabadi et.al. | 2410.03936 | link |
2024-10-01 | Resolution Enhancement of Scanning Electron Micrographs using Artificial Intelligence | Tom Reclik et.al. | 2410.03746 | null |
2024-10-04 | Point-Spread-Function Engineering in MINFLUX: Optimality of Donut and Half-Moon Excitation Patterns | Yan Liu et.al. | 2410.03349 | null |
2024-10-04 | Atom Camera: Super-resolution scanning microscope of a light pattern with a single ultracold atom | Takafumi Tomita et.al. | 2410.03241 | null |
2024-10-03 | PixelShuffler: A Simple Image Translation Through Pixel Rearrangement | Omar Zamzam et.al. | 2410.03021 | link |
2024-10-07 | SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting | Shiyun Xie et.al. | 2410.02571 | null |
2024-10-03 | PnP-Flow: Plug-and-Play Image Restoration with Flow Matching | Ségolène Martin et.al. | 2410.02423 | link |
2024-10-03 | Ultrathin BIC metasurfaces based on ultralow-loss Sb2Se3 phase-change material | Zhaoyang Xie et.al. | 2410.02413 | null |
2024-10-02 | Stochastic Deep Restoration Priors for Imaging Inverse Problems | Yuyang Hu et.al. | 2410.02057 | null |
2024-10-01 | Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism | Rui Tang et.al. | 2410.00753 | null |
2024-10-01 | Enhancing Sentinel-2 Image Resolution: Evaluating Advanced Techniques based on Convolutional and Generative Neural Networks | Patrick Kramer et.al. | 2410.00516 | null |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-27 | A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring | Yinjian Wang et.al. | 2409.18731 | null |
2024-09-27 | Simpler Gradient Methods for Blind Super-Resolution with Lower Iteration Complexity | Jinsheng Li et.al. | 2409.18387 | link |
2024-09-26 | Toward Efficient Deep Blind RAW Image Restoration | Marcos V. Conde et.al. | 2409.18204 | link |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs | Qinpeng Cui et.al. | 2409.17778 | link |
2024-09-26 | LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction | Zhongxin Yu et.al. | 2409.17759 | null |
2024-09-26 | Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image Super-Resolution | Zhenyu Hu et.al. | 2409.17597 | link |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-25 | PSWF-Radon approach to reconstruction from band-limited Hankel transform | Fedor Goncharov et.al. | 2409.17409 | link |
2024-09-25 | Implicit Neural Representations for Simultaneous Reduction and Continuous Reconstruction of Multi-Altitude Climate Data | Alif Bin Abdul Qayyum et.al. | 2409.17367 | link |
2024-09-25 | AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content | Marcos V Conde et.al. | 2409.17256 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results | Longguang Wang et.al. | 2409.16947 | null |
2024-09-24 | Diffusion Models to Enhance the Resolution of Microscopy Images: A Tutorial | Harshith Bachimanchi et.al. | 2409.16488 | null |
2024-09-24 | Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results | Marcos V. Conde et.al. | 2409.16277 | null |
2024-09-24 | Super-resolution positron emission tomography by intensity modulation: Proof of concept | Youdong Lang et.al. | 2409.16085 | null |
2024-09-24 | Denoising Graph Super-Resolution towards Improved Collider Event Reconstruction | Nilotpal Kakati et.al. | 2409.16052 | null |
2024-09-24 | Stochastically Structured Illumination Microscopy scan less super resolution imaging | Denzel Fusco et.al. | 2409.16006 | null |
2024-09-24 | Dual-Comb Photothermal Microscopy | Peter Chang et.al. | 2409.15685 | null |
2024-09-21 | BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | EungGu Kang et.al. | 2409.15384 | link |
2024-09-22 | One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Minyi Zhao et.al. | 2409.14483 | null |
2024-09-22 | Prior Knowledge Distillation Network for Face Super-Resolution | Qiu Yang et.al. | 2409.14385 | null |
2024-09-22 | Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues | Mingshen Wang et.al. | 2409.14330 | null |
2024-09-21 | A Sinkhorn Regularized Adversarial Network for Image Guided DEM Super-resolution using Frequency Selective Hybrid Graph Transformer | Subhajit Paul et.al. | 2409.14198 | null |
2024-09-21 | Vortex Interference Enables optimal 3D Interferometric Nanoscopy | Wei Wang et.al. | 2409.14033 | null |
2024-09-21 | On the Effectiveness of Neural Operators at Zero-Shot Weather Downscaling | Saumya Sinha et.al. | 2409.13955 | null |
2024-09-20 | PlainUSR: Chasing Faster ConvNet for Efficient Super-Resolution | Yan Wang et.al. | 2409.13435 | link |
2024-09-20 | Super-Resolution via Learned Predictor | Sampath Kumar Dondapati et.al. | 2409.13326 | null |
2024-09-20 | Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring | Francis Ogoke et.al. | 2409.13171 | null |
2024-09-19 | Image inpainting for corrupted images by using the semi-super resolution GAN | Mehrshad Momen-Tayefeh et.al. | 2409.12636 | null |
2024-09-19 | HSIGene: A Foundation Model For Hyperspectral Image Generation | Li Pang et.al. | 2409.12470 | link |
2024-09-17 | NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2409.12165 | null |
2024-09-18 | Quantum-like nonlinear interferometry with frequency-engineered classical light | Romain Dalidet et.al. | 2409.12049 | null |
2024-09-19 | Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing | Seongmin Hong et.al. | 2409.11738 | null |
2024-09-17 | Enhancing the Reliability of LiDAR Point Cloud Sampling: A Colorization and Super-Resolution Approach Based on LiDAR-Generated Images | Sier Ha et.al. | 2409.11532 | null |
2024-09-19 | Super Resolution On Global Weather Forecasts | Lawrence Zhang et.al. | 2409.11502 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-18 | Single-Layer Learnable Activation for Implicit Neural Representation (SL $^{2}$ A-INR) | Moein Heidari et.al. | 2409.10836 | null |
2024-09-16 | WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency | Pranav Jeevan et.al. | 2409.10582 | link |
2024-09-16 | Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Yi-Hsin Li et.al. | 2409.10101 | null |
2024-09-15 | Learning Two-factor Representation for Magnetic Resonance Image Super-resolution | Weifeng Wei et.al. | 2409.09731 | null |
2024-09-14 | Adversarial Deep-Unfolding Network for MA-XRF Super-Resolution on Old Master Paintings Using Minimal Training Data | Herman Verinaz-Jadan et.al. | 2409.09483 | null |
2024-09-17 | Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution | Yongjoon Lee et.al. | 2409.09337 | null |
2024-09-13 | FB-HyDON: Parameter-Efficient Physics-Informed Operator Learning of Complex PDEs via Hypernetwork and Finite Basis Domain Decomposition | Milad Ramezankhani et.al. | 2409.09207 | null |
2024-09-13 | Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging | Jaime Parra Raad et.al. | 2409.09031 | null |
2024-09-13 | Test-time Training for Hyperspectral Image Super-resolution | Ke Li et.al. | 2409.08667 | null |
2024-09-13 | Low Complexity DoA-ToA Signature Estimation for Multi-Antenna Multi-Carrier Systems | Chandrashekhar Rai et.al. | 2409.08650 | null |
2024-09-13 | Think Twice Before You Act: Improving Inverse Problem Solving With MCMC | Yaxuan Zhu et.al. | 2409.08551 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-12 | Mapping the nanoscale optical topological textures with a fiber-integrated plasmonic probe | Yunkun Wu et.al. | 2409.07894 | null |
2024-09-17 | Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks | Shivam Barwey et.al. | 2409.07769 | null |
2024-09-11 | Dual scale Residual-Network for turbulent flow sub grid scale resolving: A prior analysis | Omar Sallam et.al. | 2409.07605 | null |
2024-09-11 | Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications | Calum Green et.al. | 2409.07322 | link |
2024-09-11 | CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer | Feiyang Jia et.al. | 2409.07092 | null |
2024-09-10 | Lightweight Multiscale Feature Fusion Super-Resolution Network Based on Two-branch Convolution and Transformer | Li Ke et.al. | 2409.06590 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-07 | Single-snapshot machine learning for turbulence super resolution | Kai Fukami et.al. | 2409.04923 | null |
2024-09-06 | Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior | Charlesquin Kemajou Mbakam et.al. | 2409.04384 | null |
2024-09-06 | Adaptive Super-Resolution Imaging Without Prior Knowledge Using a Programmable Spatial-Mode Sorter | Itay Ozer et.al. | 2409.04323 | null |
2024-09-06 | EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution | Xi Su et.al. | 2409.04050 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Jeongsoo Kim et.al. | 2409.03516 | link |
2024-09-07 | Real-time Speech Enhancement on Raw Signals with Deep State-space Modeling | Yan Ru Pei et.al. | 2409.03377 | link |
2024-09-05 | Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images | Shaohua You et.al. | 2409.03265 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Solving Video Inverse Problems Using Image Diffusion Models | Taesung Kwon et.al. | 2409.02574 | null |
2024-09-07 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-02 | DiffEyeSyn: Diffusion-based User-specific Eye Movement Synthesis | Chuhan Jiao et.al. | 2409.01240 | null |
2024-09-02 | Single-photon super-resolved spectroscopy from spatial-mode demultiplexing | Luigi Santamaria Amato et.al. | 2409.01190 | null |
2024-09-02 | SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution | Mevan Ekanayake et.al. | 2409.01013 | null |
2024-09-01 | DMRA: An Adaptive Line Spectrum Estimation Method through Dynamical Multi-Resolution of Atoms | Mingguang Han et.al. | 2409.00799 | null |
2024-09-01 | Rethinking Image Super-Resolution from Training Data Perspectives | Go Ohtani et.al. | 2409.00768 | link |
2024-09-01 | Attention-Guided Multi-scale Interaction Network for Face Super-Resolution | Xujie Wan et.al. | 2409.00591 | null |
2024-08-30 | HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution | Masoomeh Aslahishahri et.al. | 2408.16959 | link |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-08-30 | Beyond MR Image Harmonization: Resolution Matters Too | Savannah P. Hays et.al. | 2408.16562 | null |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-08-29 | Enhanced Control for Diffusion Bridge in Image Restoration | Conghan Yue et.al. | 2408.16303 | link |
2024-08-28 | ChartEye: A Deep Learning Framework for Chart Information Extraction | Osama Mustafa et.al. | 2408.16123 | null |
2024-08-27 | Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution | Marcelo dos Santos et.al. | 2408.15386 | link |
2024-08-22 | 3D Photon Counting CT Image Super-Resolution Using Conditional Diffusion Model | Chuang Niu et.al. | 2408.15283 | null |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | A Preliminary Exploration Towards General Image Restoration | Xiangtao Kong et.al. | 2408.15143 | null |
2024-08-27 | Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach | Valfride Nascimento et.al. | 2408.15103 | link |
2024-08-26 | Cascaded Temporal Updating Network for Efficient Video Super-Resolution | Hao Li et.al. | 2408.14244 | null |
2024-08-26 | Efficient Active Flow Control Strategy for Confined Square Cylinder Wake Using Deep Learning-Based Surrogate Model and Reinforcement Learning | Meng Zhang et.al. | 2408.14232 | null |
2024-08-25 | Particle-Filtering-based Latent Diffusion for Inverse Problems | Amir Nazemi et.al. | 2408.13868 | null |
2024-08-25 | FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss | Meiyi Wei et.al. | 2408.13716 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | SIMPLE: Simultaneous Multi-Plane Self-Supervised Learning for Isotropic MRI Restoration from Anisotropic Data | Rotem Benisty et.al. | 2408.13065 | null |
2024-08-22 | A Unified Plug-and-Play Algorithm with Projected Landweber Operator for Split Convex Feasibility Problems | Shuchang Zhang et.al. | 2408.12100 | null |
2024-08-21 | MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs | Yulin Ren et.al. | 2408.11758 | link |
2024-08-21 | Quantum super-resolution microscopy by photon statistics and structured light | Fabio Picariello et.al. | 2408.11654 | null |
2024-08-22 | Phase-Based Approaches for Rapid Construction of Magnetic Fields in NV Magnetometry | Prabhat Anand et.al. | 2408.11069 | null |
2024-08-20 | MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling | Zili Liu et.al. | 2408.10854 | null |
2024-08-19 | Webcam-based Pupil Diameter Prediction Benefits from Upscaling | Vijul Shah et.al. | 2408.10397 | null |
2024-08-19 | ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer | Alik Pramanick et.al. | 2408.09940 | link |
2024-08-19 | Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration | Alik Pramanick et.al. | 2408.09912 | link |
2024-08-19 | Predicting Long-term Dynamics of Complex Networks via Identifying Skeleton in Hyperbolic Space | Ruikun Li et.al. | 2408.09845 | link |
2024-08-19 | Implicit Grid Convolution for Multi-Scale Image Super-Resolution | Dongheon Lee et.al. | 2408.09674 | link |
2024-08-18 | Angle of Arrival Estimation with Transformer: A Sparse and Gridless Method with Zero-Shot Capability | Zhaoxuan Zhu et.al. | 2408.09362 | null |
2024-08-17 | Discovery of Limb-Brightening in the Parsec-Scale Jet of NGC 315 through Global VLBI Observations and Its Implications for Jet Models | Jongho Park et.al. | 2408.09069 | null |
2024-08-16 | AI-assisted super-resolution cosmological simulations IV: An emulator for deterministic realizations | Xiaowen Zhang et.al. | 2408.09051 | link |
2024-08-25 | Task-Aware Dynamic Transformer for Efficient Arbitrary-Scale Image Super-Resolution | Tianyi Xu et.al. | 2408.08736 | link |
2024-08-16 | QMambaBSR: Burst Image Super-Resolution with Query State Space Model | Xin Di et.al. | 2408.08665 | null |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-16 | Enhancing Events in Neutrino Telescopes through Deep Learning-Driven Super-Resolution | Felix J. Yu et.al. | 2408.08474 | null |
2024-08-15 | SuperNANO: Enabling Nano-Scale Laser an-ti-counterfeiting Marking and Precision Cutting with Super-Resolution Imaging | Yiduo Chen et.al. | 2408.08455 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution | Yuzhen Li et.al. | 2408.07484 | link |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-17 | Deep-sub-cycle attosecond optical pulses | Hongliang Dang et.al. | 2408.07306 | null |
2024-08-13 | Event-Stream Super Resolution using Sigma-Delta Neural Network | Waseem Shariff et.al. | 2408.06968 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-10 | Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution | Jiang Yuan et.al. | 2408.05440 | null |
2024-08-09 | Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Ruicheng Feng et.al. | 2408.05205 | null |
2024-08-08 | Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation | Xiaole Zhao et.al. | 2408.04158 | null |
2024-08-07 | Underwater litter monitoring using consumer-grade aerial-aquatic speedy scanner (AASS) and deep learning based super-resolution reconstruction and detection network | Fan Zhao et.al. | 2408.03564 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-06 | SGSR: Structure-Guided Multi-Contrast MRI Super-Resolution via Spatio-Frequency Co-Query Attention | Shaoming Zheng et.al. | 2408.03194 | null |
2024-08-03 | Supervised Image Translation from Visible to Infrared Domain for Object Detection | Prahlad Anand et.al. | 2408.01843 | null |
2024-08-03 | Transformer for seismic image super-resolution | Shiqi Dong et.al. | 2408.01695 | null |
2024-08-03 | Flow Reconstruction Using Spatially Restricted Domains Based on Enhanced Super-Resolution Generative Adversarial Networks | Mustafa Z. Yousif et.al. | 2408.01658 | null |
2024-08-02 | PINNs for Medical Image Analysis: A Survey | Chayan Banerjee et.al. | 2408.01026 | null |
2024-08-01 | Stop-and-go waves reconstruction via iterative refinement | Junyi Ji et.al. | 2408.00941 | null |
2024-08-01 | Exceptional points in SSH-like models with hopping amplitude gradient | David S. Simon et.al. | 2408.00879 | null |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-07-31 | Accelerating Image Super-Resolution Networks with Pixel-Level Classification | Jinho Jeong et.al. | 2407.21448 | null |
2024-07-27 | Inverse Problems with Diffusion Models: A MAP Estimation Perspective | Sai bharath chandra Gutha et.al. | 2407.20784 | null |
2024-08-01 | What makes for good morphology representations for spatial omics? | Eduard Chelebian et.al. | 2407.20660 | null |
2024-07-30 | Efficient Channel Estimation for Millimeter Wave and Terahertz Systems Enabled by Integrated Super-resolution Sensing and Communication | Jingran Xu et.al. | 2407.20607 | null |
2024-07-29 | Spatial sub-Rayleigh imaging via structured speckle illumination | Liming Li et.al. | 2407.20460 | null |
2024-08-02 | Deep Learning for Super-resolution Ultrasound Imaging with Spatiotemporal Data | Arthur David Redfern et.al. | 2407.20407 | null |
2024-07-30 | Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network | Wenjie Li et.al. | 2407.19768 | link |
2024-07-28 | Giant Purcell broadening and Lamb shift for DNA-assembled near-infrared quantum emitters | Sachin Verlekar et.al. | 2407.19513 | null |
2024-07-28 | Perfect Hyperlens | Tao Hou et.al. | 2407.19506 | null |
2024-07-28 | Model-based Super-resolution: Towards a Unified Framework for Super-resolution | Zetao Fei et.al. | 2407.19480 | null |
2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-26 | Super Resolution for Renewable Energy Resource Data With Wind From Reanalysis Data (Sup3rWind) and Application to Ukraine | Brandon N. Benton et.al. | 2407.19086 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-24 | Cuboid-Net: A Multi-Branch Convolutional Neural Network for Joint Space-Time Video Super Resolution | Congrui Fu et.al. | 2407.16986 | null |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-23 | Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution | Dinh Phu Tran et.al. | 2407.16232 | null |
2024-07-23 | Topological Dark Spots of Electric Near Field in Metal Structures | Tong Fu et.al. | 2407.16213 | null |
2024-07-23 | Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Sojin Lee et.al. | 2407.16125 | link |
2024-07-22 | High-flexibility reconstruction of small-scale motions in wall turbulence using a generalized zero-shot learning | Haokai Wu et.al. | 2407.15604 | null |
2024-07-22 | Attention Beats Linear for Fast Implicit Neural Representation Generation | Shuyi Zhang et.al. | 2407.15355 | link |
2024-07-22 | ThermalNeRF: Thermal Radiance Fields | Yvette Y. Lin et.al. | 2407.15337 | null |
2024-07-22 | Efficient Multi-disparity Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2407.15329 | null |
2024-07-20 | A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Rui Qin et.al. | 2407.14880 | link |
2024-07-19 | Large Kernel Distillation Network for Efficient Single Image Super-Resolution | Chengxing Xie et.al. | 2407.14340 | link |
2024-07-19 | RealViformer: Investigating Attention for Real-World Video Super-Resolution | Yuehan Zhang et.al. | 2407.13987 | link |
2024-07-18 | MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger et.al. | 2407.13745 | link |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt | Xin Li et.al. | 2407.13108 | null |
2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
2024-07-16 | Speckle-based 3D sub-diffraction imaging through a multimode fiber | Zhouping Lyu et.al. | 2407.11796 | null |
2024-07-16 | Deconvolution with a Box | Pedro Felzenszwalb et.al. | 2407.11685 | null |
2024-07-16 | Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations | Yunya Gao et.al. | 2407.11381 | link |
2024-07-16 | Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Yaşar Utku Alçalar et.al. | 2407.11288 | null |
2024-07-14 | Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV | Zhiwen Yang et.al. | 2407.11087 | link |
2024-07-15 | Spectral Properties of Infinitely Smooth Kernel Matrices in the Single Cluster Limit, with Applications to Multivariate Super-Resolution | Nuha Diab et.al. | 2407.10600 | null |
2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
2024-07-13 | Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors | Wei Shang et.al. | 2407.09919 | link |
2024-07-13 | Fast and Provable Simultaneous Blind Super-Resolution and Demixing for Point Source Signals: Scaled Gradient Descent without Regularization | Jinchi Chen et.al. | 2407.09900 | link |
2024-07-12 | Region Attention Transformer for Medical Image Restoration | Zhiwen Yang et.al. | 2407.09268 | link |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-11 | Global Spatial-Temporal Information-based Residual ConvLSTM for Video Space-Time Super-Resolution | Congrui Fu et.al. | 2407.08466 | null |
2024-07-11 | Wind Power Assessment based on Super-Resolution and Downscaling – A Comparison of Deep Learning Methods | Luca Schmidt et.al. | 2407.08259 | null |
2024-07-11 | Spatially-Variant Degradation Model for Dataset-free Super-resolution | Shaojie Guo et.al. | 2407.08252 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-10 | Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks | Alejandro Villena-Rodriguez et.al. | 2407.07434 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | UnmixingSR: Material-aware Network with Unsupervised Unmixing as Auxiliary Task for Hyperspectral Image Super-resolution | Yang Yu et.al. | 2407.06525 | null |
2024-07-08 | Enhancing super-resolution ultrasound localisation through multi-frame deconvolution exploiting spatiotemporal coherence | Su Yan et.al. | 2407.06373 | null |
2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | null |
2024-07-08 | Self-Prior Guided Mamba-UNet Networks for Medical Image Super-Resolution | Zexin Ji et.al. | 2407.05993 | null |
2024-07-08 | Deform-Mamba Network for MRI Super-Resolution | Zexin Ji et.al. | 2407.05969 | null |
2024-07-08 | HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution | Xiang Zhang et.al. | 2407.05878 | null |
2024-07-08 | Neuromorphic Imaging with Super-Resolution | Pei Zhang et.al. | 2407.05764 | null |
2024-07-07 | Edge-guided and Cross-scale Feature Fusion Network for Efficient Multi-contrast MRI Super-Resolution | Zhiyuan Yang et.al. | 2407.05307 | link |
2024-07-07 | A Hybrid Registration and Fusion Method for Hyperspectral Super-resolution | Kunjing Yang et.al. | 2407.05279 | null |
2024-07-07 | RIS-assisted Coverage Enhancement in mmWave Integrated Sensing and Communication Networks | Xu Gan et.al. | 2407.05249 | null |
2024-07-05 | NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2407.04815 | null |
2024-07-08 | Super-resolution imaging of nanoscale inhomogeneities in hBN-covered and encapsulated few-layer graphene | Lina Jäckering et.al. | 2407.04565 | null |
2024-07-05 | AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource | Wengyi Zhan et.al. | 2407.04241 | link |
2024-07-04 | M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask | Xinyu Yang et.al. | 2407.03695 | null |
2024-07-04 | ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution | Yuanbo Zhou et.al. | 2407.03598 | null |
2024-07-04 | Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis | Tong Zhou et.al. | 2407.03089 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-02 | Adversarial Magnification to Deceive Deepfake Detection through Super Resolution | Davide Alessandro Coccomini et.al. | 2407.02670 | link |
2024-07-01 | Broadband planar electromagnetic hyper-lens with uniform magnification in air | Ran Sun et.al. | 2407.02532 | null |
2024-07-04 | Real HSI-MSI-PAN image dataset for the hyperspectral/multi-spectral/panchromatic image fusion and super-resolution fields | Shuangliang Li et.al. | 2407.02387 | link |
2024-07-02 | Efficient Stochastic Differential Equation for DEM Super Resolution with Void Filling | Tongtong Zhang et.al. | 2407.01908 | null |
2024-07-01 | DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models | Chang-Han Yeh et.al. | 2407.01519 | link |
2024-07-02 | Preserving Full Degradation Details for Blind Image Super-Resolution | Hongda Liu et.al. | 2407.01299 | link |
2024-07-01 | DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution | Crispian Morris et.al. | 2407.01230 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence | Xiantao Fan et.al. | 2406.20047 | null |
2024-06-28 | CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion | Chih-Chung Hsu et.al. | 2406.19666 | link |
2024-06-28 | Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion | Quanmin Liang et.al. | 2406.19640 | link |
2024-06-27 | Shoulder of Dust Rings Formed by Planet-disk Interactions | Jiaqing Bi et.al. | 2406.19438 | null |
2024-06-27 | Super-resolution imaging using super-oscillatory diffractive neural networks | Hang Chen et.al. | 2406.19126 | null |
2024-06-26 | Spatial-temporal Hierarchical Reinforcement Learning for Interpretable Pathology Image Super-Resolution | Wenting Chen et.al. | 2406.18310 | link |
2024-06-30 | V2X Sidelink Positioning in FR1: From Ray-Tracing and Channel Estimation to Bayesian Tracking | Yu Ge et.al. | 2406.17950 | null |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | A Near-Field Super-Resolution Network for Accelerating Antenna Characterization | Yuchen Gu et.al. | 2406.17244 | null |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution | Junxiong Lin et.al. | 2406.16459 | null |
2024-06-24 | Improving Generative Adversarial Networks for Video Super-Resolution | Daniel Wen et.al. | 2406.16359 | null |
2024-06-23 | Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning | Ruisheng Gao et.al. | 2406.16083 | null |
2024-06-23 | Gridless Parameter Estimation in Partly Calibrated Rectangular Arrays | Tianyi Liu et.al. | 2406.16041 | null |
2024-06-23 | Learning Accurate and Enriched Features for Stereo Image Super-Resolution | Hu Gao et.al. | 2406.16001 | link |
2024-06-21 | A Generative Machine Learning Approach for Improving Precipitation from Earth System Models | Philipp Hess et.al. | 2406.15026 | null |
2024-06-20 | Zero-Shot Image Denoising for High-Resolution Electron Microscopy | Xuanyu Tian et.al. | 2406.14264 | link |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Enhance the Image: Super Resolution using Artificial Intelligence in MRI | Ziyu Li et.al. | 2406.13625 | null |
2024-06-19 | EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | Dachun Kai et.al. | 2406.13457 | link |
2024-06-19 | Super-resolution 3D tomography of vector near-fields in dielectric resonators | Bingbing Zhu et.al. | 2406.13171 | null |
2024-06-18 | Structured Detection for Simultaneous Super-Resolution and Optical Sectioning in Laser Scanning Microscopy | Alessandro Zunino et.al. | 2406.12542 | link |
2024-06-18 | LFMamba: Light Field Image Super-Resolution with State Space Model | Wang xia et.al. | 2406.12463 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-16 | Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution | Cuixin Yang et.al. | 2406.10869 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | Exact Sparse Representation Recovery in Signal Demixing and Group BLASSO | Marcello Carioni et.al. | 2406.09922 | null |
2024-06-14 | Bayesian Conditioned Diffusion Models for Inverse Problems | Alper Güngör et.al. | 2406.09768 | null |
2024-06-13 | Near-Field Multiuser Communications based on Sparse Arrays | Kangjian Chen et.al. | 2406.09238 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Microparticle-assisted 2D super resolution virtual image modeling | Arlen Bekirov et.al. | 2406.09060 | null |
2024-06-13 | Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation | Jingyuan Xia et.al. | 2406.08896 | link |
2024-06-12 | $\texttt{DiffLense}$ : A Conditional Diffusion Model for Super-Resolution of Gravitational Lensing Data | Pranath Reddy et.al. | 2406.08442 | null |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | link |
2024-06-14 | One-Step Effective Diffusion Network for Real-World Image Super-Resolution | Rongyuan Wu et.al. | 2406.08177 | link |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-12 | Towards Realistic Data Generation for Real-World Super-Resolution | Long Peng et.al. | 2406.07255 | null |
2024-06-10 | 2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution | Kai Liu et.al. | 2406.06649 | link |
2024-06-10 | Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning | Xin Wang et.al. | 2406.05974 | null |
2024-06-09 | Binarized Diffusion Model for Image Super-Resolution | Zheng Chen et.al. | 2406.05723 | link |
2024-06-07 | M2NO: Multiresolution Operator Learning with Multiwavelet-based Algebraic Multigrid Method | Zhihao Li et.al. | 2406.04822 | null |
2024-06-06 | M&M VTO: Multi-Garment Virtual Try-On and Editing | Luyang Zhu et.al. | 2406.04542 | link |
2024-06-06 | Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models | Jan Martinů et.al. | 2406.04099 | null |
2024-06-06 | Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations | Jan Hagnberger et.al. | 2406.03919 | link |
2024-06-07 | Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge | Nan Zhang et.al. | 2406.03799 | link |
2024-06-05 | SuperFormer: Volumetric Transformer Architectures for MRI Super-Resolution | Cristhian Forigua et.al. | 2406.03359 | link |
2024-06-01 | CoNO: Complex Neural Operator for Continous Dynamical Physical Systems | Karn Tiwari et.al. | 2406.02597 | null |
2024-06-04 | ReLUs Are Sufficient for Learning Implicit Neural Representations | Joseph Shenouda et.al. | 2406.02529 | link |
2024-06-05 | Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Clement Chadebec et.al. | 2406.02347 | link |
2024-06-03 | L-MAGIC: Language Model Assisted Generation of Images with Coherence | Zhipeng Cai et.al. | 2406.01843 | link |
2024-06-03 | PolyCLEAN: When Högbom meets Bayes – Fast Super-Resolution Imaging with Bayesian MAP Estimation | Adrian Jarret et.al. | 2406.01342 | link |
2024-06-03 | Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism | Zhaomin Fang et.al. | 2406.01240 | null |
2024-06-02 | Stealing Image-to-Image Translation Models With a Single Query | Nurit Spingarn-Eliezer et.al. | 2406.00828 | null |
2024-06-02 | Multidimensional optical singularities and their applications | Soon Wei Daniel Lim et.al. | 2406.00784 | null |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-01 | GLCAN: Global-Local Collaborative Auxiliary Network for Local Learning | Feiyu Zhu et.al. | 2406.00446 | null |
2024-06-01 | SpikeMM: Flexi-Magnification of High-Speed Micro-Motions | Baoyue Zhang et.al. | 2406.00383 | null |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Climate Variable Downscaling with Conditional Normalizing Flows | Christina Winkler et.al. | 2405.20719 | null |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | All-In-One Medical Image Restoration via Task-Adaptive Routing | Zhiwen Yang et.al. | 2405.19769 | link |
2024-05-30 | MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile | Wenshuo Yu et.al. | 2405.19767 | null |
2024-05-29 | Reconstructing Interpretable Features in Computational Super-Resolution microscopy via Regularized Latent Search | Marzieh Gheisari et.al. | 2405.19112 | null |
2024-05-29 | Single image super-resolution based on trainable feature matching attention network | Qizhou Chen et.al. | 2405.18872 | link |
2024-05-29 | Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching | Yasi Zhang et.al. | 2405.18816 | link |
2024-05-28 | Towards a Sampling Theory for Implicit Neural Representations | Mahrokh Najaf et.al. | 2405.18410 | null |
2024-05-28 | Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations | Ting Wang et.al. | 2405.17818 | null |
2024-05-27 | Fast Samplers for Inverse Problems in Iterative Refinement Models | Kushagra Pandey et.al. | 2405.17673 | link |
2024-05-27 | Does Diffusion Beat GAN in Image Super Resolution? | Denis Kuznedelev et.al. | 2405.17261 | link |
2024-06-02 | PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution | Yong Liu et.al. | 2405.17158 | link |
2024-05-27 | Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models | Cristina N. Vasconcelos et.al. | 2405.16759 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | BOLD: Boolean Logic Deep Learning | Van Minh Nguyen et.al. | 2405.16339 | null |
2024-05-24 | Visible-frequency hyperbolic plasmon polaritons in a natural van der Waals crystal | Giacomo Venturi et.al. | 2405.15420 | null |
2024-05-29 | Stochastic super-resolution for Gaussian microtextures | Emile Pierret et.al. | 2405.15399 | null |
2024-05-24 | Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving | Jia He et.al. | 2405.15241 | null |
2024-05-23 | Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution | Zakariya Chaouai et.al. | 2405.14934 | null |
2024-05-24 | Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | Hongxu Jiang et.al. | 2405.14802 | link |
2024-05-23 | Stimulated Raman-induced Beam Focusing | Minhaeng Cho et.al. | 2405.14240 | null |
2024-05-22 | Perceptual Fairness in Image Restoration | Guy Ohayon et.al. | 2405.13805 | null |
2024-05-22 | HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera | Yunfan Lu et.al. | 2405.13389 | null |
2024-05-20 | Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution | Xihaier Luo et.al. | 2405.12202 | null |
2024-05-18 | HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos | Qifeng Chen et.al. | 2405.11270 | null |
2024-05-17 | AdaWaveNet: Adaptive Wavelet Network for Time Series Analysis | Han Yu et.al. | 2405.11124 | null |
2024-05-27 | Infrared Image Super-Resolution via Lightweight Information Split Network | Shijie Liu et.al. | 2405.10561 | null |
2024-05-16 | RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods | Xin Qiao et.al. | 2405.10357 | null |
2024-05-16 | Bilateral Event Mining and Complementary for Event Stream Super-Resolution | Zhilin Huang et.al. | 2405.10037 | link |
2024-05-16 | Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution | Xingjian Wang et.al. | 2405.10014 | null |
2024-05-16 | IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | Yongsong Huang et.al. | 2405.09873 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-15 | Low-Complexity Joint Azimuth-Range-Velocity Estimation for Integrated Sensing and Communication with OFDM Waveform | Jun Zhang et.al. | 2405.09443 | null |
2024-05-15 | Large coordinate kernel attention network for lightweight image super-resolution | Fangwei Hao et.al. | 2405.09353 | null |
2024-05-14 | NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution | Yihong Chen et.al. | 2405.08423 | link |
2024-05-23 | Exploring the Low-Pass Filtering Behavior in Image Super-Resolution | Haoyu Deng et.al. | 2405.07919 | link |
2024-05-13 | CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Qingguo Liu et.al. | 2405.07648 | link |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-11 | Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution | Long Peng et.al. | 2405.07023 | link |
2024-05-11 | Incorporating Degradation Estimation in Light Field Spatial Super-Resolution | Zeyu Xiao et.al. | 2405.07012 | null |
2024-05-11 | Super-Resolving Blurry Images with Events | Chi Zhang et.al. | 2405.06918 | null |
2024-05-10 | Machine learning for reconstruction of polarity inversion lines from solar filaments | V. Kisielius et.al. | 2405.06293 | link |
2024-05-07 | Single-antenna 3D localization with nonseparable toroidal pulses | Ren Wang et.al. | 2405.05979 | null |
2024-05-17 | Diag2Diag: Multimodal super-resolution diagnostics for physics discovery with application to fusion | Azarakhsh Jalalvand et.al. | 2405.05908 | null |
2024-05-09 | Multi-Level Feature Fusion Network for Lightweight Stereo Image Super-Resolution | Yunxiang Li et.al. | 2405.05497 | link |
2024-05-08 | HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution | Shu-Chuan Chu et.al. | 2405.05001 | link |
2024-05-08 | Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution | Yi Xiao et.al. | 2405.04964 | link |
2024-05-08 | Teacher-Student Network for Real-World Face Super-Resolution with Progressive Embedding of Edge Information | Zhilei Liu et.al. | 2405.04778 | null |
2024-05-07 | An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution | Naveed Sultan et.al. | 2405.04595 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-08 | Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer | Zhuoyi Yang et.al. | 2405.04312 | link |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-11 | DVMSR: Distillated Vision Mamba for Efficient Super-Resolution | Xiaoyan Lei et.al. | 2405.03008 | link |
2024-05-05 | I $^3$ Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis | Haofei Song et.al. | 2405.02857 | null |
2024-05-05 | Antenna Failure Resilience: Deep Learning-Enabled Robust DOA Estimation with Single Snapshot Sparse Arrays | Ruxin Zheng et.al. | 2405.02788 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations | Zhilu Zhang et.al. | 2405.02171 | link |
2024-05-03 | Optical skyrmions from metafibers | Tiantian He et.al. | 2405.01962 | null |
2024-05-05 | TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms | Yueyuan Sui et.al. | 2405.01242 | null |
2024-05-02 | Single Image Super-Resolution Based on Global-Local Information Synergy | Nianzu Qiao et.al. | 2405.01085 | null |
2024-05-01 | Detail-Enhancing Framework for Reference-Based Image Super-Resolution | Zihan Wang et.al. | 2405.00431 | null |
2024-04-30 | Replica-assisted super-resolution fluorescence imaging in scattering media | Tengfei Wu et.al. | 2404.19734 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | Super-resolution by converting evanescent waves in microsphere to propagating and transfer function from its surface to nano-jet | Y. Ben-Aryeh et.al. | 2404.19333 | null |
2024-04-29 | Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing | Leonardo Rossi et.al. | 2404.18924 | link |
2024-04-27 | Generative Diffusion-based Downscaling for Climate | Robbie A. Watt et.al. | 2404.17752 | link |
2024-04-26 | Federated Learning for Blind Image Super-Resolution | Brian B. Moser et.al. | 2404.17670 | null |
2024-04-26 | One-Shot Image Restoration | Deborah Pereg et.al. | 2404.17426 | null |
2024-04-26 | Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model | Yushen Xu et.al. | 2404.17357 | link |
2024-04-25 | Deep learning-based blind image super-resolution with iterative kernel reconstruction and noise estimation | Hasan F. Ates et.al. | 2404.16564 | link |
2024-04-25 | Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey | Marcos V. Conde et.al. | 2404.16484 | link |
2024-04-25 | Latent Modulated Function for Computational Optimal Continuous Image Representation | Zongyao He et.al. | 2404.16451 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-24 | Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey | Marcos V. Conde et.al. | 2404.16223 | link |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-24 | Super-resolution imaging based on active optical intensity interferometry | Lu-Chuan Liu et.al. | 2404.15685 | null |
2024-04-26 | A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Zhixiong Yang et.al. | 2404.15620 | link |
2024-04-23 | Super-resolved CARS by coherent image scanning | Anna Zhitnitsky et.al. | 2404.15094 | null |
2024-04-23 | Canalization-based super-resolution imaging using a single van der Waals layer | Jiahua Duan et.al. | 2404.14876 | null |
2024-04-22 | SwinFuSR: an image fusion-inspired model for RGB-guided thermal image super-resolution | Cyprien Arnold et.al. | 2404.14533 | link |
2024-04-29 | ALMA 2D Super-resolution Imaging of Taurus-Auriga Protoplanetary Disks: Probing Statistical Properties of Disk Substructures | Masayuki Yamaguchi et.al. | 2404.13570 | null |
2024-04-26 | SEGSRNet for Stereo-Endoscopic Image Super-Resolution and Surgical Instrument Segmentation | Mansoor Hayat et.al. | 2404.13330 | null |
2024-04-19 | Single-sample image-fusion upsampling of fluorescence lifetime images | Valentin Kapitány et.al. | 2404.13102 | null |
2024-04-19 | A New Multi-Picture Architecture for Learned Video Deinterlacing and Demosaicing with Parallel Deformable Convolution and Self-Attention Blocks | Ronglei Ji et.al. | 2404.13018 | link |
2024-04-19 | Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics | Xiaofei Wang et.al. | 2404.12973 | null |
2024-04-18 | VideoGigaGAN: Towards Detail-rich Video Super-Resolution | Yiran Xu et.al. | 2404.12388 | null |
2024-04-19 | Multichannel-GaAsP-photomultiplier-based fiber bundle ISM-STED microscope | Marcus Babin et.al. | 2404.12370 | null |
2024-04-18 | Multiphoton super-resolution imaging via virtual structured illumination | Sumin Lim et.al. | 2404.11849 | null |
2024-04-18 | Partial Large Kernel CNNs for Efficient Super-Resolution | Dongheon Lee et.al. | 2404.11848 | link |
2024-04-17 | Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution | Cansu Korkmaz et.al. | 2404.11273 | link |
2024-04-16 | Uncertainty Quantification of Super-Resolution Flow Mapping in Liquid Metals using Ultrasound Localization Microscopy | David Weik et.al. | 2404.10840 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report | Bin Ren et.al. | 2404.10343 | link |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-17 | OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Runyi Li et.al. | 2404.10312 | null |
2024-04-16 | Little Pilot is Needed for Channel Estimation with Integrated Super-Resolution Sensing and Communication | Jingran Xu et.al. | 2404.10233 | null |
2024-04-15 | The Problem Of Image Super-Resolution, Denoising And Some Image Restoration Methods In Deep Learning Models | Ngoc-Giau Pham et.al. | 2404.09817 | null |
2024-04-15 | NTIRE 2024 Challenge on Image Super-Resolution ( $\times$ 4): Methods and Results | Zheng Chen et.al. | 2404.09790 | link |
2024-04-15 | MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution | Yuxuan Jiang et.al. | 2404.09571 | null |
2024-04-15 | Super-resolution of biomedical volumes with 2D supervision | Cheng Jiang et.al. | 2404.09425 | null |
2024-04-15 | Differentiable Search for Finding Optimal Quantization Strategy | Lianqiang Li et.al. | 2404.08010 | null |
2024-04-11 | Terahertz imaging super-resolution for documental heritage diagnostics | Danae Antunez Vazquez et.al. | 2404.07798 | null |
2024-04-11 | Near-field reconstruction of periodic structures with superimposed illumination | Jue Wang et.al. | 2404.07763 | null |
2024-04-11 | Deep learning-driven pulmonary arteries and veins segmentation reveals demography-associated pulmonary vasculature anatomy | Yuetan Chu et.al. | 2404.07671 | link |
2024-04-10 | Unfolding ADMM for Enhanced Subspace Clustering of Hyperspectral Images | Xianlu Li et.al. | 2404.07112 | link |
2024-04-09 | Dynamic Deep Learning Based Super-Resolution For The Shallow Water Equations | Maximilian Witte et.al. | 2404.06400 | null |
2024-04-09 | Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures | Arkaprabha Basu et.al. | 2404.06294 | null |
2024-04-09 | LIPT: Latency-aware Image Processing Transformer | Junbo Qiao et.al. | 2404.06075 | null |
2024-04-09 | Space-Time Video Super-resolution with Neural Operator | Yuantong Zhang et.al. | 2404.06036 | null |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-04-09 | Resolution enhancement of SOHO/MDI Magnetograms | Ying Qin et.al. | 2404.05968 | null |
2024-04-08 | Nanomolecular OLED Pixelization Enabling Electroluminescent Metasurfaces | Tommaso Marcato et.al. | 2404.05336 | null |
2024-04-07 | Gull: A Generative Multifunctional Audio Codec | Yi Luo et.al. | 2404.04947 | null |
2024-04-07 | Efficient Learnable Collaborative Attention for Single Image Super-Resolution | Yigang Zhao Chaowei Zheng et.al. | 2404.04922 | null |
2024-04-07 | CycleINR: Cycle Implicit Neural Representation for Arbitrary-Scale Volumetric Super-Resolution of Medical Data | Wei Fang et.al. | 2404.04878 | null |
2024-04-07 | Effect of active loop extrusion on the two-contact correlations in the interphase chromosome | Dmitry Starkov et.al. | 2404.04853 | null |
2024-04-07 | Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution | Guangyuan Li et.al. | 2404.04785 | link |
2024-04-06 | Collaborative Feedback Discriminative Propagation for Video Super-Resolution | Hao Li et.al. | 2404.04745 | link |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-06 | PointSAGE: Mesh-independent superresolution approach to fluid flow predictions | Rajat Sarkar et.al. | 2404.04615 | null |
2024-04-03 | Translation-based Video-to-Video Synthesis | Pratim Saha et.al. | 2404.04283 | null |
2024-04-05 | Real-GDSR: Real-World Guided DSM Super-Resolution via Edge-Enhancing Residual Network | Daniel Panangian et.al. | 2404.03930 | null |
2024-04-05 | The ESPRIT algorithm under high noise: Optimal error scaling and noisy super-resolution | Zhiyan Ding et.al. | 2404.03885 | null |
2024-04-04 | AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution | Cheeun Hong et.al. | 2404.03296 | link |
2024-04-04 | CSR-dMRI: Continuous Super-Resolution of Diffusion MRI with Anatomical Structure-assisted Implicit Neural Representation Learning | Ruoyou Wu et.al. | 2404.03209 | null |
2024-04-04 | Quantum enhanced mechanical rotation sensing using wavefront photonic gears | Ofir Yesharim et.al. | 2404.02797 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | null |
2024-04-03 | Two-Stage Super-Resolution Simulation Method for Three-Dimensional Flow Fields Around Buildings for Real-Time Prediction of Urban Micrometeorology | Yuki Yasuda et.al. | 2404.02631 | link |
2024-04-03 | Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution | Simiao Li et.al. | 2404.02573 | null |
2024-04-02 | Super-Resolution Analysis for Landfill Waste Classification | Matias Molina et.al. | 2404.01790 | null |
2024-04-03 | AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation | Rui Xie et.al. | 2404.01717 | null |
2024-04-04 | Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Jaeha Kim et.al. | 2404.01692 | link |
2024-04-02 | RefQSR: Reference-based Quantization for Image Super-Resolution Networks | Hongjae Lee et.al. | 2404.01690 | null |
2024-04-01 | Video Interpolation with Diffusion Models | Siddhant Jain et.al. | 2404.01203 | null |
2024-04-01 | DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF | Jie Long Lee et.al. | 2404.00874 | link |
2024-04-02 | DRCT: Saving Image Super-resolution away from Information Bottleneck | Chih-Chung Hsu et.al. | 2404.00722 | link |
2024-03-31 | DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion | Chunyang Bi et.al. | 2404.00661 | null |
2024-03-30 | SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising | Runmin Zhang et.al. | 2404.00349 | null |
2024-03-30 | Exploiting Self-Supervised Constraints in Image Super-Resolution | Gang Wu et.al. | 2404.00260 | link |
2024-04-03 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Structured illumination microscopy with extreme ultraviolet pulses | R. Mincigrucci et.al. | 2403.19382 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-03-27 | Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction | Yiyao Zhang et.al. | 2403.18776 | link |
2024-03-27 | Ship in Sight: Diffusion Models for Ship-Image Super Resolution | Luigi Sigillo et.al. | 2403.18370 | link |
2024-03-27 | Super-Resolution of SOHO/MDI Magnetograms of Solar Active Regions Using SDO/HMI Data and an Attention-Aided Convolutional Neural Network | Chunhui Xu et.al. | 2403.18302 | null |
2024-03-26 | Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections | Chia-Hao Chiang et.al. | 2403.17847 | null |
2024-03-26 | Algorithmic unfolding for image reconstruction and localization problems in fluorescence microscopy | Silvia Bonettini et.al. | 2403.17506 | link |
2024-03-26 | SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder | Dihan Zheng et.al. | 2403.17502 | link |
2024-03-26 | Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model | Runmin Dong et.al. | 2403.17460 | link |
2024-03-25 | A Study in Dataset Pruning for Image Super-Resolution | Brian B. Moser et.al. | 2403.17083 | null |
2024-03-25 | Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution | Zhikai Chen et.al. | 2403.17000 | null |
2024-03-25 | Self-STORM: Deep Unrolled Self-Supervised Learning for Super-Resolution Microscopy | Yair Ben Sahel et.al. | 2403.16974 | link |
2024-03-25 | Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution | Qingping Zheng et.al. | 2403.16643 | null |
2024-03-25 | Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging | Jintong Hu et.al. | 2403.16384 | link |
2024-03-24 | CFAT: Unleashing TriangularWindows for Image Super-resolution | Abhisek Ray et.al. | 2403.16143 | link |
2024-03-23 | Adaptive Super Resolution For One-Shot Talking-Head Generation | Luchuan Song et.al. | 2403.15944 | link |
2024-03-23 | Time-series Initialization and Conditioning for Video-agnostic Stabilization of Video Super-Resolution using Recurrent Networks | Hiroshi Mori et.al. | 2403.15832 | null |
2024-03-20 | Using Super-Resolution Imaging for Recognition of Low-Resolution Blurred License Plates: A Comparative Study of Real-ESRGAN, A-ESRGAN, and StarSRGAN | Ching-Hsiang Wang et.al. | 2403.15466 | null |
2024-03-22 | Deep Generative Model based Rate-Distortion for Image Downscaling Assessment | Yuanbang Liang et.al. | 2403.15139 | link |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping | Zhuang Xiong et.al. | 2403.14070 | null |
2024-03-20 | Multi-photon super-linear image scanning microscopy using upconversion nanoparticles | Yao Wang et.al. | 2403.13436 | null |
2024-03-20 | Efficient scene text image super-resolution with semantic guidance | LeoWu TomyEnrique et.al. | 2403.13330 | link |
2024-03-18 | Super-resolution of ultrafast pulses via spectral inversion | Michał Lipka et.al. | 2403.12746 | null |
2024-03-18 | A Wideband Distributed Massive MIMO Channel Sounder for Communication and Sensing | Michiel Sandra et.al. | 2403.11856 | null |
2024-03-18 | PAON: A New Neuron Model using Padé Approximants | Onur Keleş et.al. | 2403.11791 | null |
2024-03-18 | CasSR: Activating Image Power for Real-World Image Super-Resolution | Haolan Chen et.al. | 2403.11451 | null |
2024-03-18 | VmambaIR: Visual State Space Model for Image Restoration | Yuan Shi et.al. | 2403.11423 | link |
2024-03-17 | Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution | Jialu Sui et.al. | 2403.11078 | link |
2024-03-16 | Boosting Flow-based Generative Super-Resolution Models via Learned Prior | Li-Yuan Tsao et.al. | 2403.10988 | link |
2024-03-16 | Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution | Zhiheng Li et.al. | 2403.10925 | null |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-15 | Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint | Haoyue Tang et.al. | 2403.10585 | null |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516 | link |
2024-03-21 | Deep Bi-directional Attention Network for Image Super-Resolution Quality Assessment | Yixiao Li et.al. | 2403.10406 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution | Feng Li et.al. | 2403.10211 | link |
2024-03-15 | SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation | Peng Zheng et.al. | 2403.10166 | null |
2024-03-14 | Deep unfolding Network for Hyperspectral Image Super-Resolution with Automatic Exposure Correction | Yuan Fang et.al. | 2403.09096 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | Activating Wider Areas in Image Super-Resolution | Cheng Cheng et.al. | 2403.08330 | null |
2024-03-07 | Accelerating multigrid solver with generative super-resolution | Francisco Holguin et.al. | 2403.07936 | null |
2024-03-19 | Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation | Di Mi et.al. | 2403.07673 | null |
2024-03-12 | Learning Correction Errors via Frequency-Self Attention for Blind Image Super-Resolution | Haochen Sun et.al. | 2403.07390 | null |
2024-03-12 | Efficient Diffusion Model for Image Restoration by Residual Shifting | Zongsheng Yue et.al. | 2403.07319 | link |
2024-03-12 | Learning Hierarchical Color Guidance for Depth Map Super-Resolution | Runmin Cong et.al. | 2403.07290 | null |
2024-03-11 | Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5 | Takatoshi Shibuya et.al. | 2403.06729 | null |
2024-03-11 | Breaking Abbe’s diffraction limit with harmonic deactivation microscopy | Kevin Murzyn et.al. | 2403.06617 | null |
2024-03-11 | Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution | Jinchen Zhu et.al. | 2403.06536 | null |
2024-03-10 | Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising | Yuang Wang et.al. | 2403.06069 | null |
2024-03-12 | Decoupled Data Consistency with Diffusion Purification for Image Restoration | Xiang Li et.al. | 2403.06054 | link |
2024-03-15 | CoNFiLD: Conditional Neural Field Latent Diffusion Model Generating Spatiotemporal Turbulence | Pan Du et.al. | 2403.05940 | null |
2024-03-09 | Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution | Junxiong Lin et.al. | 2403.05808 | null |
2024-03-08 | An End-to-End Pipeline Perspective on Video Streaming in Best-Effort Networks: A Survey and Tutorial | Leonardo Peroni et.al. | 2403.05192 | null |
2024-03-08 | CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Wendi Zheng et.al. | 2403.05121 | null |
2024-03-08 | XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | Yunpeng Qu et.al. | 2403.05049 | link |
2024-03-07 | Super-resolution on network telemetry time series | Fengchen Gong et.al. | 2403.04165 | null |
2024-03-11 | Identifying Black Holes Through Space Telescopes and Deep Learning | Yeqi Fang et.al. | 2403.03821 | null |
2024-03-05 | Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning | Haoyu Chen et.al. | 2403.02601 | null |
2024-03-04 | UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images | Zhiyi He et.al. | 2403.02132 | null |
2024-03-03 | APISR: Anime Production Inspired Real-World Anime Super-Resolution | Boyang Wang et.al. | 2403.01598 | link |
2024-03-02 | Extrapolated Plug-and-Play Three-Operator Splitting Methods for Nonconvex Optimization with Applications to Image Restoration | Zhongming Wu et.al. | 2403.01144 | link |
2024-03-02 | Text-guided Explorable Image Super-resolution | Kanchana Vaishnavi Gandikota et.al. | 2403.01124 | null |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | SeD: Semantic-Aware Discriminator for Image Super-Resolution | Bingchen Li et.al. | 2402.19387 | link |
2024-02-29 | 3D Super-resolution Optical Fluctuation Imaging with Temporal Focusing two-photon excitation | Pawel Szczypkowski et.al. | 2402.19338 | null |
2024-03-15 | CAMixerSR: Only Details Need More “Attention” | Yan Wang et.al. | 2402.19289 | link |
2024-02-29 | Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz et.al. | 2402.19215 | link |
2024-02-29 | Unsupervised Learning of High-resolution Light Field Imaging via Beam Splitter-based Hybrid Lenses | Jianxin Lei et.al. | 2402.19020 | null |
2024-03-01 | Navigating Beyond Dropout: An Intriguing Solution Towards Generalizable Image Super Resolution | Hongjun Wang et.al. | 2402.18929 | link |
2024-02-29 | LoLiSRFlow: Joint Single Image Low-light Enhancement and Super-resolution via Cross-scale Transformer-based Conditional Flow | Ziyu Yue et.al. | 2402.18871 | null |
2024-02-28 | Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis | Bashir Kazimi et.al. | 2402.18286 | null |
2024-02-28 | Misalignment-Robust Frequency Distribution Loss for Image Transformation | Zhangkai Ni et.al. | 2402.18192 | link |
2024-03-01 | Data-driven nonlinear turbulent flow scaling with Buckingham Pi variables | Kai Fukami et.al. | 2402.17990 | null |
2024-02-27 | Thermodynamics-informed super-resolution of scarce temporal dynamics data | Carlos Bermejo-Barbanoj et.al. | 2402.17506 | null |
2024-02-27 | Spatial super-resolution in nanosensing with blinking emitters | Alexander Mikhalychev et.al. | 2402.17391 | null |
2024-02-27 | Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network | Zhaoyang Wang et.al. | 2402.17285 | link |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369 | null |
2024-02-25 | Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation | Christopher Wiedeman et.al. | 2402.16212 | null |
2024-02-25 | ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings | Alexander Schmidt et.al. | 2402.16188 | null |
2024-02-25 | XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras | Arnav Mishra et.al. | 2402.16175 | null |
2024-02-24 | HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models | Li Pang et.al. | 2402.15865 | link |
2024-02-24 | A Heterogeneous Dynamic Convolutional Neural Network for Image Super-resolution | Chunwei Tian et.al. | 2402.15704 | link |
2024-02-24 | DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data | Lixian Zhang et.al. | 2402.15659 | link |
2024-02-23 | Towards complete all-optical emission control of high-harmonic generation from solids | Pieter J. van Essen et.al. | 2402.15375 | null |
2024-02-21 | Generative Adversarial Models for Extreme Downscaling of Climate Datasets | Guiye Li et.al. | 2402.14049 | null |
2024-02-23 | Scene Prior Filtering for Depth Map Super-Resolution | Zhengxue Wang et.al. | 2402.13876 | null |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-20 | Diffusion Posterior Sampling is Computationally Intractable | Shivam Gupta et.al. | 2402.12727 | null |
2024-02-19 | Image Super-resolution Inspired Electron Density Prediction | Chenghan Li et.al. | 2402.12335 | link |
2024-02-19 | Regularization by denoising: Bayesian model and Langevin-within-split Gibbs sampling | Elhadji C. Faye et.al. | 2402.12292 | null |
2024-02-19 | FOD-Swin-Net: angular super resolution of fiber orientation distribution using a transformer-based deep model | Mateus Oliveira da Silva et.al. | 2402.11775 | link |
2024-02-25 | Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme | Saeid Haghighatshoar et.al. | 2402.11748 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-16 | Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary Task Integration | Mahapara Khurshid et.al. | 2402.10454 | null |
2024-02-08 | Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results | Kelly Payette et.al. | 2402.09463 | null |
2024-02-14 | Neural Operators Meet Energy-based Theory: Operator Learning for Hamiltonian and Dissipative PDEs | Yusuke Tanaka et.al. | 2402.09018 | null |
2024-02-12 | Cosmology at the Field Level with Probabilistic Machine Learning | Adam Rouhiainen et.al. | 2402.07694 | null |
2024-02-12 | Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback | Cansu Korkmaz et.al. | 2402.07597 | null |
2024-02-12 | High-resolution Cryogenic Spectroscopy of Single Molecules in Nanoprinted Crystals | Mohammad Musavinezhad et.al. | 2402.07474 | null |
2024-02-09 | Copper phosphate micro-flowers coated with indocyanine green and iron oxide nanoparticles for in vivo localization optoacoustic tomography and magnetic actuation | Daniil Nozdriukhin et.al. | 2402.06749 | null |
2024-02-05 | Hybrid Neural Representations for Spherical Data | Hyomin Kim et.al. | 2402.05965 | null |
2024-02-07 | Arbitrary Scale Super-Resolution Assisted Lunar Crater Detection in Satellite Images | Atal Tewari et.al. | 2402.05068 | null |
2024-02-07 | Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO | Yinchuan Li et.al. | 2402.04704 | null |
2024-02-06 | Elastic wave imaging with Maxwell’s fish-eye lens | Liuxian Zhao et.al. | 2402.04285 | null |
2024-02-06 | 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN | Juhyung Ha et.al. | 2402.04171 | null |
2024-02-05 | Video Super-Resolution for Optimized Bitrate and Green Online Streaming | Vignesh V Menon et.al. | 2402.03513 | null |
2024-02-05 | See More Details: Efficient Image Super-Resolution by Experts Mining | Eduard Zamfir et.al. | 2402.03412 | link |
2024-01-25 | When Geoscience Meets Generative AI and Large Language Models: Foundations, Trends, and Future Challenges | Abdenour Hadid et.al. | 2402.03349 | null |
2024-02-05 | Instant square lattice structured illumination microscopy: an optimal strategy towards photon-saving and real-time super-resolution observation | Tianyu Zhao et.al. | 2402.02775 | null |
2024-02-02 | A Robust Super-resolution Gridless Imaging Framework for UAV-borne SAR Tomography | Silin Gao et.al. | 2402.01194 | null |
2024-02-01 | Diffusion-based Light Field Synthesis | Ruisheng Gao et.al. | 2402.00575 | null |
2024-01-31 | Improving Object Detection Quality in Football Through Super-Resolution Techniques | Karolina Seweryn et.al. | 2402.00163 | null |
2024-01-31 | Fully Data-Driven Model for Increasing Sampling Rate Frequency of Seismic Data using Super-Resolution Generative Adversarial Networks | Navid Gholizadeh et.al. | 2402.00153 | null |
2024-01-31 | Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models | Kyungsung Lee et.al. | 2401.17629 | null |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258 | null |
2024-01-30 | Ptycho-endoscopy on a lensless ultrathin fiber bundle tip | Pengming Song et.al. | 2401.17213 | null |
2024-01-30 | Deep 3D World Models for Multi-Image Super-Resolution Beyond Optical Flow | Luca Savant Aira et.al. | 2401.16972 | null |
2024-01-29 | Reconfigurable AI Modules Aided Channel Estimation and MIMO Detection | Xiangzhao Qin et.al. | 2401.16141 | null |
2024-01-29 | Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing | Jeongho Min et.al. | 2401.15944 | null |
2024-01-29 | Vision-Informed Flow Image Super-Resolution with Quaternion Spatial Modeling and Dynamic Flow Convolution | Qinglong Cao et.al. | 2401.15913 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-31 | Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models | Fabio Merizzi et.al. | 2401.15469 | link |
2024-01-27 | Face to Cartoon Incremental Super-Resolution using Knowledge Distillation | Trinetra Devkatte et.al. | 2401.15366 | null |
2024-01-26 | From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution | Ragib Amin Nihal et.al. | 2401.14661 | null |
2024-01-26 | Super Efficient Neural Network for Compression Artifacts Reduction and Super Resolution | Wen Ma et.al. | 2401.14641 | null |
2024-01-25 | Combined Generative and Predictive Modeling for Speech Super-resolution | Heming Wang et.al. | 2401.14269 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-02-05 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945 | null |
2024-01-22 | Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method | Zili Liu et.al. | 2401.11960 | link |
2024-01-24 | LKFormer: Large Kernel Transformer for Infrared Image Super-Resolution | Feiwei Qin et.al. | 2401.11859 | link |
2024-01-22 | Simultaneous Blind Demixing and Super-resolution via Vectorized Hankel Lift | Haifeng Wang et.al. | 2401.11805 | null |
2024-01-18 | Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Xin Yuan et.al. | 2401.10404 | null |
2024-01-22 | 3D orientation super-resolution spatial-frequency-shift microscopy | Xiaowei Liu et.al. | 2401.09085 | null |
2024-01-17 | Efficient Image Super-Resolution via Symmetric Visual Attention Network | Chengxu Wu et.al. | 2401.08913 | null |
2024-01-16 | Robust DOA estimation using deep acoustic imaging | Adrian S. Roman et.al. | 2401.08717 | link |
2024-01-20 | Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Zhenhui Ye et.al. | 2401.08503 | link |
2024-01-16 | Physics-informed Meta-instrument for eXperiments (PiMiX) with applications to fusion energy | Zhehui Wang et.al. | 2401.08390 | null |
2024-01-18 | Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary | Leheng Zhang et.al. | 2401.08209 | link |
2024-01-16 | The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation | Xinni Jiang et.al. | 2401.08123 | link |
2024-01-26 | No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy | Mohammad Khateri et.al. | 2401.08115 | null |
2024-01-15 | Sparsity-based background removal for STORM super-resolution images | Patris Valera et.al. | 2401.07746 | link |
2024-01-15 | Time-varying k-domain modulation around a point sink in time reversal cavity | Xin Liu et.al. | 2401.07535 | null |
2024-01-14 | City Scene Super-Resolution via Geometric Error Minimization | Zhengyang Lu et.al. | 2401.07272 | link |
2024-01-13 | Deep Blind Super-Resolution for Satellite Video | Yi Xiao et.al. | 2401.07139 | link |
2024-01-12 | Broad Yet Narrow: Super-resolution techniques to simulate electronic spectra of large molecular systems | Matthias Kick et.al. | 2401.06929 | null |
2024-01-15 | Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention | Xingyu Zhou et.al. | 2401.06312 | link |
2024-01-11 | Frequency-Time Diffusion with Neural Cellular Automata | John Kalkhof et.al. | 2401.06291 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach | Gang Wu et.al. | 2401.05633 | link |
2024-01-10 | Quantum Inspired Microwave Phase Super-Resolution at Room Temperature | Leonid Vidro et.al. | 2401.05026 | null |
2024-01-08 | AGG: Amortized Generative 3D Gaussians for Single Image to 3D | Dejia Xu et.al. | 2401.04099 | null |
2024-01-08 | Sub-Rayleigh ghost imaging via structured illumination | Liming Li et.al. | 2401.03829 | null |
2024-01-08 | FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring | Geunhyuk Youk et.al. | 2401.03707 | null |
2024-01-07 | Nanofabrication beyond optical diffraction limit: Optical driven assembly enabled by superlubricity | Liu Jiang-tao et.al. | 2401.03486 | null |
2024-01-05 | Super-Resolution Multi-Contrast Unbiased Eye Atlases With Deep Probabilistic Refinement | Ho Hin Lee et.al. | 2401.03060 | null |
2024-01-04 | Predicting Future States with Spatial Point Processes in Single Molecule Resolution Spatial Transcriptomics | Parisa Boodaghi Malidarreh et.al. | 2401.02564 | null |
2024-01-04 | What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs | Alex Trevithick et.al. | 2401.02411 | null |
2024-01-02 | Efficient Hybrid Zoom using Camera Fusion on Mobile Phones | Xiaotong Wu et.al. | 2401.01461 | null |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2023-12-30 | Improving the Stability of Diffusion Models for Content Consistent Super-Resolution | Lingchen Sun et.al. | 2401.00877 | link |
2024-03-18 | Exposure Bracketing is All You Need for Unifying Image Restoration and Enhancement Tasks | Zhilu Zhang et.al. | 2401.00766 | link |
2024-01-01 | Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2401.00740 | null |
2024-02-06 | Diffusion Models, Image Super-Resolution And Everything: A Survey | Brian B. Moser et.al. | 2401.00736 | null |
2024-02-21 | Compressing Deep Image Super-resolution Models | Yuxuan Jiang et.al. | 2401.00523 | null |
2023-12-31 | UGPNet: Universal Generative Prior for Image Restoration | Hwayoon Lee et.al. | 2401.00370 | null |
2023-12-30 | Robust fluctuation-based super-resolution microscopy in a confocal architecture | Alexander Krupinski-Ptaszek et.al. | 2401.00261 | null |
2024-03-13 | Image Super-resolution Reconstruction Network based on Enhanced Swin Transformer via Alternating Aggregation of Local-Global Features | Yuming Huang et.al. | 2401.00241 | null |
2023-12-29 | Noise-free Optimization in Early Training Steps for Image Super-Resolution | MinKyu Lee et.al. | 2312.17526 | link |
2023-12-28 | Single particle algorithms to reveal cellular nanodomain organization | Pierre Parutto et.al. | 2312.17191 | null |
2024-01-02 | KeDuSR: Real-World Dual-Lens Super-Resolution via Kernel-Free Matching | Huanjing Yue et.al. | 2312.17050 | link |
2023-12-27 | Learning from small data sets: Patch-based regularizers in inverse problems for image reconstruction | Moritz Piening et.al. | 2312.16611 | null |
2023-12-27 | Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance | Tomer Garber et.al. | 2312.16519 | link |
2023-12-30 | A Survey on Super Resolution for video Enhancement Using GAN | Ankush Maity et.al. | 2312.16471 | null |
2023-12-27 | Learn From Orientation Prior for Radiograph Super-Resolution: Orientation Operator Transformer | Yongsong Huang et.al. | 2312.16455 | null |
2023-12-24 | BSRAW: Improving Blind RAW Image Super-Resolution | Marcos V. Conde et.al. | 2312.15487 | link |
2023-12-24 | Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspective | Lingchen Sun et.al. | 2312.15408 | link |
2023-12-22 | Spectrally Decomposed Diffusion Models for Generative Turbulence Recovery | Mohammed Sardar et.al. | 2312.15029 | null |
2023-12-22 | DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution | Yan Wang et.al. | 2312.14551 | link |
2024-03-18 | HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Hayk Manukyan et.al. | 2312.14091 | link |
2023-12-21 | Super-resolution of THz time-domain images based on low-rank representation | Marina Ljubenovic et.al. | 2312.13820 | null |
2023-12-21 | BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution | Guochen Yu et.al. | 2312.13722 | link |
2023-12-21 | A Comprehensive End-to-End Computer Vision Framework for Restoration and Recognition of Low-Quality Engineering Drawings | Lvyang Yang et.al. | 2312.13620 | link |
2023-12-20 | EPNet: An Efficient Pyramid Network for Enhanced Single-Image Super-Resolution with Reduced Computational Requirements | Xin Xu et.al. | 2312.13396 | null |
2024-03-19 | ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Rongsheng Wang et.al. | 2312.13316 | link |
2023-12-20 | A 3D super-resolution of wind fields via physics-informed pixel-wise self-attention generative adversarial network | Takuya Kurihana et.al. | 2312.13212 | null |
2023-12-20 | Joint Range-Velocity-Azimuth Estimation for OFDM-Based Integrated Sensing and Communication | Zelin Hu et.al. | 2312.13154 | null |
2024-03-18 | Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence | Hongyuan Wang et.al. | 2312.12833 | null |
2023-12-20 | How Good Are Deep Generative Models for Solving Inverse Problems? | Shichong Peng et.al. | 2312.12691 | null |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | Neural operator-based super-fidelity: A warm-start approach for accelerating steady-state simulations | Xu-Hui Zhou et.al. | 2312.11842 | null |
2023-12-18 | TIP: Text-Driven Image Processing with Semantic and Restoration Instructions | Chenyang Qi et.al. | 2312.11595 | null |
2023-12-20 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-18 | Disentangling photon rings beyond General Relativity with future radio-telescope arrays | Raúl Carballo-Rubio et.al. | 2312.11351 | null |
2024-03-19 | Self-Supervised Learning for Image Super-Resolution and Deblurring | Jérémy Scanvic et.al. | 2312.11232 | link |
2023-12-18 | Experimental 3D super-localization with Laguerre-Gaussian modes | Chenyu Hu et.al. | 2312.11044 | null |
2023-12-16 | Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge | Conghan Yue et.al. | 2312.10299 | link |
2023-12-18 | TMP: Temporal Motion Propagation for Online Video Super-Resolution | Zhengqiang Zhang et.al. | 2312.09909 | link |
2024-03-03 | Diffusion-based Blind Text Image Super-Resolution | Yuzhe Zhang et.al. | 2312.08886 | link |
2023-12-14 | Guided Image Restoration via Simultaneous Feature and Image Guided Fusion | Xinyi Liu et.al. | 2312.08853 | null |
2023-12-14 | CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence | Xiran Zhou et.al. | 2312.08600 | null |
2023-12-13 | EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset | Peiqi Duan et.al. | 2312.08220 | null |
2023-12-13 | Toward Real World Stereo Image Super-Resolution via Hybrid Degradation Model and Discriminator for Implied Stereo Image Information | Yuanbo Zhou et.al. | 2312.07934 | link |
2023-12-20 | CoIE: Chain-of-Instruct Editing for Multi-Attribute Face Manipulation | Zhenduo Zhang et.al. | 2312.07879 | null |
2023-12-13 | Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements | Gaurav Shrivastava et.al. | 2312.07835 | null |
2024-01-19 | Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution | Qi Tang et.al. | 2312.07823 | link |
2023-12-12 | Super-Resolution on Rotationally Scanned Photoacoustic Microscopy Images Incorporating Scanning Prior | Kai Pan et.al. | 2312.07226 | link |
2023-12-12 | Hyper-Restormer: A General Hyperspectral Image Restoration Transformer for Remote Sensing Imaging | Yo-Yu Lai et.al. | 2312.07016 | null |
2023-12-14 | TULIP: Transformer for Upsampling of LiDAR Point Cloud | Bin Yang et.al. | 2312.06733 | link |
2023-12-11 | Photorealistic Video Generation with Diffusion Models | Agrim Gupta et.al. | 2312.06662 | null |
2023-12-11 | Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution | Shangchen Zhou et.al. | 2312.06640 | null |
2023-12-11 | Non-iterative Methods in Inhomogeneous Background Inverse Scattering Imaging Problem Assisted by Swin Transformer Network | Naike Du et.al. | 2312.06302 | null |
2023-12-11 | Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution | Binxiao Huang et.al. | 2312.06101 | link |
2024-03-20 | Precipitation Downscaling with Spatiotemporal Video Diffusion | Prakhar Srivastava et.al. | 2312.06071 | null |
2023-12-10 | Study of Multiuser Multiple-Antenna Wireless Communications Systems Based on Super-Resolution Arrays | S. Pinto et.al. | 2312.06033 | null |
2023-12-10 | Transformer-based Selective Super-Resolution for Efficient Image Refinement | Tianyi Zhang et.al. | 2312.05803 | link |
2023-12-13 | SGNet: Structure Guided Network via Gradient-Frequency Awareness for Depth Map Super-Resolution | Zhengxue Wang et.al. | 2312.05799 | link |
2023-12-09 | Iterative Token Evaluation and Refinement for Real-World Super-Resolution | Chaofeng Chen et.al. | 2312.05616 | link |
2023-12-07 | AniRes2D: Anisotropic Residual-enhanced Diffusion for 2D MR Super-Resolution | Zejun Wu et.al. | 2312.04385 | null |
Remote Sensing
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-26 | Machine Learning and Multi-source Remote Sensing in Forest Carbon Stock Estimation: A Review | Autumn Nguyen et.al. | 2411.17624 | null |
2024-11-26 | MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection | Juefei He et.al. | 2411.17167 | null |
2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | null |
2024-11-26 | SatVision-TOA: A Geospatial Foundation Model for Coarse-Resolution All-Sky Remote Sensing Imagery | Caleb S. Spradlin et.al. | 2411.17000 | link |
2024-11-25 | CMAViT: Integrating Climate, Managment, and Remote Sensing Data for Crop Yield Estimation with Multimodel Vision Transformers | Hamid Kamangir et.al. | 2411.16989 | null |
2024-11-23 | Gradient-Guided Parameter Mask for Multi-Scenario Image Restoration Under Adverse Weather | Jilong Guo et.al. | 2411.16739 | link |
2024-11-25 | GeoFormer: A Multi-Polygon Segmentation Transformer | Maxim Khomiakov et.al. | 2411.16616 | link |
2024-11-25 | Coronal hole picoflare jets are the progenitors of both the fast and the Alfvénic slow solar wind | L. P. Chitta et.al. | 2411.16513 | null |
2024-11-24 | DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation | Ruiqiang Xiao et.al. | 2411.15976 | null |
2024-11-24 | PromptHSI: Universal Hyperspectral Image Restoration Framework for Composite Degradation | Chia-Ming Lee et.al. | 2411.15922 | null |
2024-11-26 | LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Wuzheng Dong et.al. | 2411.15808 | link |
2024-11-24 | Text-Guided Coarse-to-Fine Fusion Network for Robust Remote Sensing Visual Question Answering | Zhicheng Zhao et.al. | 2411.15770 | null |
2024-11-26 | AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation | Datao Tang et.al. | 2411.15497 | null |
2024-11-22 | Improved Background Estimation for Gas Plume Identification in Hyperspectral Images | Scout Jarman et.al. | 2411.15378 | null |
2024-11-20 | Multimodal large language model for wheat breeding: a new exploration of smart breeding | Guofeng Yang et.al. | 2411.15203 | null |
2024-11-22 | Resolution-Adaptive Micro-Doppler Spectrogram for Human Activity Recognition | Do-Hyun Park et.al. | 2411.15057 | null |
2024-11-22 | Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding | Junde Liu et.al. | 2411.14781 | null |
2024-11-22 | Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Zengbao Sun et.al. | 2411.14704 | null |
2024-11-21 | Uncertainty-Aware Regression for Socio-Economic Estimation via Multi-View Remote Sensing | Fan Yang et.al. | 2411.14119 | link |
2024-11-21 | Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation | Ming Zhao et.al. | 2411.13847 | null |
2024-11-23 | Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images | Xuechao Zou et.al. | 2411.13127 | link |
2024-11-20 | Automatic marker-free registration based on similar tetrahedras for single-tree point clouds | Jing Ren et.al. | 2411.13069 | null |
2024-11-20 | Attentive Contextual Attention for Cloud Removal | Wenli Huang et.al. | 2411.13042 | link |
2024-11-23 | Machine learned reconstruction of tsunami dynamics from sparse observations | Edward McDugald et.al. | 2411.12948 | null |
2024-11-19 | Characterization of sea ice kinematics over oceanic eddies | Minki Kim et.al. | 2411.12926 | null |
2024-11-19 | Tree Species Classification using Machine Learning and 3D Tomographic SAR – a case study in Northern Europe | Colverd Grace et.al. | 2411.12897 | null |
2024-11-19 | Machine Learning Approaches on Crop Pattern Recognition a Comparative Analysis | Kazi Hasibul Kabir et.al. | 2411.12667 | null |
2024-11-18 | Terahertz Photonics on a Chip: Monolithically Integrated Terahertz Optoelectronics based on Quantum Well Structures | Yifan Zhao et.al. | 2411.12046 | null |
2024-11-16 | GeoGround: A Unified Large Vision-Language Model. for Remote Sensing Visual Grounding | Yue Zhou et.al. | 2411.11904 | link |
2024-11-18 | CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset | Zhiming Wang et.al. | 2411.11360 | link |
2024-11-18 | Cuvis.Ai: An Open-Source, Low-Code Software Ecosystem for Hyperspectral Processing and Classification | Nathaniel Hanson et.al. | 2411.11324 | link |
2024-11-17 | Program Evaluation with Remotely Sensed Outcomes | Ashesh Rambachan et.al. | 2411.10959 | null |
2024-11-16 | Decentralized Localization of Distributed Antenna Array Elements Using an Evolutionary Algorithm | Matthew J. Dula et.al. | 2411.10907 | null |
2024-11-16 | Large Vision-Language Models for Remote Sensing Visual Question Answering | Surasakdi Siripong et.al. | 2411.10857 | null |
2024-11-15 | Remote-sensing based control of 3D magnetic fields using machine learning for in-operando applications | Miguel A. Cascales Sandoval et.al. | 2411.10374 | null |
2024-11-15 | DaYu: Data-Driven Model for Geostationary Satellite Observed Cloud Images Forecasting | Xujun Wei et.al. | 2411.10144 | null |
2024-11-15 | A Polarization Image Dehazing Method Based on the Principle of Physical Diffusion | Zhenjun Zhang et.al. | 2411.09924 | null |
2024-11-14 | Comparative Study of InGaAs and GaAsSb Nanowires for Room Temperature Operation of Avalanche Photodiodes at 1.55 μm | Shrivatch Sankar et.al. | 2411.09795 | null |
2024-11-14 | Adaptively Augmented Consistency Learning: A Semi-supervised Segmentation Framework for Remote Sensing | Hui Ye et.al. | 2411.09344 | null |
2024-11-14 | LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation | Zhenshi Li et.al. | 2411.09301 | link |
2024-11-14 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery | Ashim Dahal et.al. | 2411.09101 | link |
2024-11-13 | High-resolution optical and acoustic remote sensing datasets of the Puck Lagoon, Southern Baltic | Łukasz Janowski et.al. | 2411.08712 | null |
2024-11-13 | Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model | Jun Xie et.al. | 2411.08592 | null |
2024-11-13 | Restoration algorithms and system performance evaluation for active imagers | Jerome Gilles et.al. | 2411.08291 | null |
2024-11-12 | CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory | Zhenkai Wu et.al. | 2411.07863 | link |
2024-11-12 | Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge | Emmanuel Azuh Mensah et.al. | 2411.07834 | null |
2024-11-12 | Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Wuzheng Dong et.al. | 2411.07802 | link |
2024-11-12 | Kernel-based retrieval models for hyperspectral image data optimized with Kernel Flows | Zina-Sabrina Duma et.al. | 2411.07800 | null |
2024-11-12 | AdaSemiCD: An Adaptive Semi-Supervised Change Detection Method Based on Pseudo-Label Evaluation | Ran Lingyan et.al. | 2411.07758 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG | Zilun Zhang et.al. | 2411.07688 | null |
2024-11-12 | Quantum Information-Empowered Graph Neural Network for Hyperspectral Change Detection | Chia-Hsiang Lin et.al. | 2411.07608 | null |
2024-11-12 | Semantic segmentation on multi-resolution optical and microwave data using deep learning | Jai G Singla et.al. | 2411.07581 | null |
2024-11-11 | United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images | Yanguang Sun et.al. | 2411.06703 | link |
2024-11-11 | Flight Demonstration and Model Validation of a Prototype Variable-Altitude Venus Aerobot | Jacob S. Izraelevitz et.al. | 2411.06643 | null |
2024-11-09 | Aquila-plus: Prompt-Driven Visual-Language Models for Pixel-Level Remote Sensing Image Understanding | Kaixuan Lu et.al. | 2411.06142 | null |
2024-11-09 | Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing | Kaixuan Lu et.al. | 2411.06091 | null |
2024-11-09 | Aquila: A Hierarchically Aligned Visual-Language Model for Enhanced Remote Sensing Image Comprehension | Kaixuan Lu et.al. | 2411.06074 | null |
2024-11-09 | Movable Antennas in Wireless Systems: A Tool for Connectivity or a New Security Threat? | Youssef Maghrebi et.al. | 2411.06028 | null |
2024-11-08 | Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model | Shuchang Lyu et.al. | 2411.05878 | link |
2024-11-05 | From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing | Xintian Sun et.al. | 2411.05826 | null |
2024-11-08 | STARS: Sensor-agnostic Transformer Architecture for Remote Sensing | Ethan King et.al. | 2411.05714 | null |
2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | null |
2024-11-11 | Anticipatory Understanding of Resilient Agriculture to Climate | David Willmes et.al. | 2411.05219 | null |
2024-11-07 | In the Era of Prompt Learning with Vision-Language Models | Ankit Jha et.al. | 2411.04892 | null |
2024-11-07 | ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing | Zhihui Zhang et.al. | 2411.04706 | null |
2024-11-07 | DNN-based 3D Cloud Retrieval for Variable Solar Illumination and Multiview Spaceborne Imaging | Tamar Klein et.al. | 2411.04682 | null |
2024-11-07 | Population estimation using 3D city modelling and Carto2S datasets – A case study | Jai G Singla et.al. | 2411.04612 | null |
2024-11-07 | Uncertainty Prediction Neural Network (UpNet): Embedding Artificial Neural Network in Bayesian Inversion Framework to Quantify the Uncertainty of Remote Sensing Retrieval | Dasheng Fan et.al. | 2411.04556 | null |
2024-11-07 | Remote Sensing-Based Assessment of Economic Development | Yijian Pan et.al. | 2411.04396 | link |
2024-11-06 | Urban Flood Mapping Using Satellite Synthetic Aperture Radar Data: A Review of Characteristics, Approaches and Datasets | Jie Zhao et.al. | 2411.04153 | null |
2024-11-06 | An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging | Públio Elon Correa da Silva et.al. | 2411.03835 | link |
2024-11-06 | Beyond Grid Data: Exploring Graph Neural Networks for Earth Observation | Shan Zhao et.al. | 2411.03223 | null |
2024-11-05 | EcoCropsAID: Economic Crops Aerial Image Dataset for Land Use Classification | Sangdaow Noppitak et.al. | 2411.02762 | null |
2024-11-05 | DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark | Haodong Li et.al. | 2411.02733 | link |
2024-11-05 | Super-resolution generalized eigenvalue method with truly sub-Nyquist sampling | Baoguo Liu et.al. | 2411.02700 | null |
2024-11-04 | Estimating the Number and Locations of Boundaries in Reverberant Environments with Deep Learning | Toros Arikan et.al. | 2411.02609 | null |
2024-11-04 | Theoretical performance limitations and filter selection based on Fisher information of a computational photonic crystal spectrometer for trace-gas retrieval | Marijn Siemons et.al. | 2411.02048 | null |
2024-11-04 | Shrinking the Giant : Quasi-Weightless Transformers for Low Energy Inference | Shashank Nag et.al. | 2411.01818 | null |
2024-11-06 | AIWR: Aerial Image Water Resource Dataset for Segmentation Analysis | Sangdaow Noppitak et.al. | 2411.01797 | null |
2024-11-03 | PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation | Xinyu Xu et.al. | 2411.01624 | null |
2024-11-03 | RS-MoE: Mixture of Experts for Remote Sensing Image Captioning and Visual Question Answering | Hui Lin et.al. | 2411.01595 | null |
2024-11-03 | Reconstructing MODIS Normalized Difference Snow Index Product on Greenland Ice Sheet Using Spatiotemporal Extreme Gradient Boosting Model | Fan Ye et.al. | 2411.01450 | null |
2024-11-01 | A Robust Super-Resolution Classifier by Nonlinear Optics | Ishan Darji et.al. | 2411.00953 | link |
2024-10-31 | Identifying Spatio-Temporal Drivers of Extreme Events | Mohamad Hakam Shams Eddin et.al. | 2410.24075 | link |
2024-10-31 | Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images | Yakun Xie et.al. | 2410.23991 | null |
2024-10-31 | MV-CC: Mask Enhanced Video Model for Remote Sensing Change Caption | Ruixun Liu et.al. | 2410.23946 | link |
2024-10-31 | AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery | Hangyu Zhou et.al. | 2410.23891 | link |
2024-10-31 | Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection | Ke Li et.al. | 2410.23828 | link |
2024-10-30 | Multilingual Vision-Language Pre-training for the Remote Sensing Domain | João Daniel Silva et.al. | 2410.23370 | link |
2024-10-30 | Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA | Ankur Garg et.al. | 2410.23319 | null |
2024-11-03 | RSNet: A Light Framework for The Detection of Multi-scale Remote Sensing Targets | Hongyu Chen et.al. | 2410.23073 | null |
2024-10-30 | Bio-optical characterization using Ocean Colour Monitor (OCM) on board EOS-06 in coastal region | Anurag Gupta et.al. | 2410.22833 | null |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-31 | CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Ziyang Gong et.al. | 2410.22629 | link |
2024-10-29 | Remote Sensing for Weed Detection and Control | Ishita Bansal et.al. | 2410.22554 | null |
2024-10-26 | MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation | Jialin Luo et.al. | 2410.22362 | null |
2024-10-29 | Shining a Light on Hurricane Damage Estimation via Nighttime Light Data: Pre-processing Matters | Nancy Thomas et.al. | 2410.22150 | null |
2024-10-29 | Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models | Imad Ali Shah et.al. | 2410.22101 | link |
2024-10-29 | Micro-Structures Graph-Based Point Cloud Registration for Balancing Efficiency and Accuracy | Rongling Zhang et.al. | 2410.21857 | null |
2024-10-28 | Mapping the Sun’s coronal magnetic field using the Zeeman effect | Thomas A. Schad et.al. | 2410.21568 | null |
2024-10-28 | A minimal model of the deep-convection lifecycle and its verification in remote-sensing observations | Tobias Bölle et.al. | 2410.20887 | null |
2024-10-25 | OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery | Philipe Dias et.al. | 2410.19965 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-25 | Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction | Yiqing Guo et.al. | 2410.19256 | null |
2024-10-24 | CAMEL-Bench: A Comprehensive Arabic LMM Benchmark | Sara Ghaboura et.al. | 2410.18976 | link |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-24 | Precision Soil Quality Analysis Using Transformer-based Data Fusion Strategies: A Systematic Review | Mahdi Saki et.al. | 2410.18353 | null |
2024-10-23 | Nested active regions anchor the heliospheric current sheet and stall the reversal of the coronal magnetic field | Adam J. Finley et.al. | 2410.18244 | null |
2024-10-22 | Marine Microplastics and Infant Health | Xinming Du et.al. | 2410.17391 | null |
2024-10-15 | Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques | Lijie Tao et.al. | 2410.17283 | link |
2024-10-24 | HyperspectralViTs: General Hyperspectral Models for On-board Remote Sensing | Vít Růžička et.al. | 2410.17248 | null |
2024-10-22 | PGCS: Physical Law embedded Generative Cloud Synthesis in Remote Sensing Images | Liying Xu et.al. | 2410.16955 | link |
2024-10-26 | Foundation Models for Remote Sensing and Earth Observation: A Survey | Aoran Xiao et.al. | 2410.16602 | link |
2024-10-22 | Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance | Zhangwei Gao et.al. | 2410.16261 | link |
2024-10-19 | Wave (from) Polarized Light Learning (WPLL) method: high resolution spatio-temporal measurements of water surface waves in laboratory setups | Noam Ginio et.al. | 2410.14988 | null |
2024-10-18 | Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ | Arpan Mahara et.al. | 2410.14836 | null |
2024-10-17 | RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images | Kejun Ren et.al. | 2410.13532 | null |
2024-10-17 | SAda-Net: A Self-Supervised Adaptive Stereo Estimation CNN For Remote Sensing Image Data | Dominik Hirner et.al. | 2410.13500 | link |
2024-10-26 | SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation Semantic Segmentation in Remote Sensing | Bin Wang et.al. | 2410.13471 | link |
2024-10-17 | RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents | Zhuoran Liu et.al. | 2410.13384 | null |
2024-10-16 | Revisiting the hydrodynamic modulation of short surface waves by longer waves | Milan Curcic et.al. | 2410.12960 | link |
2024-10-14 | LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism | Wenyu Liu et.al. | 2410.11580 | link |
2024-10-15 | MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation | Xianping Ma et.al. | 2410.11160 | link |
2024-10-14 | Real-Time Localization and Bimodal Point Pattern Analysis of Palms Using UAV Imagery | Kangning Cui et.al. | 2410.11124 | link |
2024-10-11 | Advancements in Ship Detection: Comparative Analysis of Optical and Hyperspectral Sensors | Alyazia Al Shamsi et.al. | 2410.10888 | null |
2024-10-14 | Regression Model for Speckled Data with Extremely Variability | A. D. C. Nascimento et.al. | 2410.10482 | null |
2024-10-14 | LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections | Xuezhi Xiang et.al. | 2410.10433 | null |
2024-10-14 | A Surface Adaptive First-Look Inspection Planner for Autonomous Remote Sensing of Open-Pit Mines | Vignesh Kottayam Viswanathan et.al. | 2410.10256 | null |
2024-10-15 | ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing | Yuduo Wang et.al. | 2410.10047 | link |
2024-10-12 | Bi-temporal Gaussian Feature Dependency Guided Change Detection in Remote Sensing Images | Yi Xiao et.al. | 2410.09539 | null |
2024-10-11 | Underutilized land and sustainable development: effects on employment, economic output, and mitigation of CO2 emissions | Seymur Garibov et.al. | 2410.09136 | null |
2024-10-11 | Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery | Pratinav Seth et.al. | 2410.09032 | null |
2024-10-11 | Data-Driven Neural Estimation of Indirect Rate-Distortion Function | Zichao Yu et.al. | 2410.09018 | null |
2024-10-11 | Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation | Zhe Dong et.al. | 2410.08613 | link |
2024-10-10 | Exploring Foundation Models in Remote Sensing Image Change Detection: A Comprehensive Survey | Zihan Yu et.al. | 2410.07824 | null |
2024-10-10 | Enhancing Hyperspectral Image Prediction with Contrastive Learning in Low-Label Regime | Salma Haidar et.al. | 2410.07790 | null |
2024-10-11 | Self-Supervised Learning for Real-World Object Detection: a Survey | Alina Ciocarlan et.al. | 2410.07442 | null |
2024-10-09 | Segmenting objects with Bayesian fusion of active contour models and convnet priors | Przemyslaw Polewski et.al. | 2410.07421 | null |
2024-10-09 | Quantum Frequency Combs with Path Identity for Quantum Remote Sensing | D. A. R. Dalvit et.al. | 2410.07044 | null |
2024-10-08 | Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images | Shiyu Miao et.al. | 2410.06194 | link |
2024-10-08 | GLRT-Based Metric Learning for Remote Sensing Object Retrieval | Linping Zhang et.al. | 2410.05773 | null |
2024-10-08 | Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery | Xuanchen et.al. | 2410.05717 | null |
2024-10-08 | Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion | Yice Cao et.al. | 2410.05624 | null |
2024-10-07 | A Deep Learning-Based Approach for Mangrove Monitoring | Lucas José Velôso de Souza et.al. | 2410.05443 | link |
2024-10-07 | IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification | Yan He et.al. | 2410.05100 | null |
2024-10-06 | Learning De-Biased Representations for Remote-Sensing Imagery | Zichen Tian et.al. | 2410.04546 | link |
2024-10-05 | Molecular Hydrogen Line Identifications in Solar Flares Observed by IRIS: Lower Atmospheric Structure from Radiometric Analysis | Sarah A. Jaeggli et.al. | 2410.04267 | null |
2024-10-04 | SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 | Hao Yu et.al. | 2410.03962 | null |
2024-10-07 | Machine Learning for Asymptomatic Ratoon Stunting Disease Detection With Freely Available Satellite Based Multispectral Imaging | Ethan Kane Waters et.al. | 2410.03141 | null |
2024-10-03 | Multiscale Multi-Type Spatial Bayesian Analysis of Wildfires and Population Change That Avoids MCMC and Approximating the Posterior Distribution | Shijie Zhou et.al. | 2410.02905 | null |
2024-10-03 | Exact Bayesian Inference for Multivariate Spatial Data of Any Size with Application to Air Pollution Monitoring | Madelyn Clinch et.al. | 2410.02655 | null |
2024-10-02 | SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images | Kaiyu Li et.al. | 2410.01768 | link |
2024-10-02 | SinkSAM: A Monocular Depth-Guided SAM Framework for Automatic Sinkhole Segmentation | Osher Rafaeli et.al. | 2410.01473 | link |
2024-10-01 | Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer | Vlatko Spasev et.al. | 2410.01092 | null |
2024-10-01 | Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data | Ivica Dimitrovski et.al. | 2410.00469 | null |
2024-09-30 | GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling | Qun Dai et.al. | 2409.19835 | link |
2024-09-29 | OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images | Jiaqi Zhao et.al. | 2409.19648 | link |
2024-09-29 | Perspectives and challenges in bolide infrasound processing and interpretation: A focused review with case studies | Elizabeth A. Silber et.al. | 2409.19537 | null |
2024-09-28 | Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery | Andy V. Huynh et.al. | 2409.19439 | null |
2024-09-27 | Deep Learning Enhanced Quantum Holography with Undetected Photons | Weiru Fan et.al. | 2409.18887 | null |
2024-09-27 | Seeing the Invisible through Speckle Images | Weiru Fan et.al. | 2409.18815 | null |
2024-10-01 | A TextGCN-Based Decoding Approach for Improving Remote Sensing Image Captioning | Swadhin Das et.al. | 2409.18467 | null |
2024-09-26 | Find Rhinos without Finding Rhinos: Active Learning with Multimodal Imagery of South African Rhino Habitats | Lucia Gordon et.al. | 2409.18104 | link |
2024-09-26 | LDA-MIG Detectors for Maritime Targets in Nonhomogeneous Sea Clutter | Xiaoqiang Hua et.al. | 2409.17911 | null |
2024-09-26 | AgMTR: Agent Mining Transformer for Few-shot Segmentation in Remote Sensing | Hanbo Bi et.al. | 2409.17453 | link |
2024-09-30 | Improving satellite imagery segmentation using multiple Sentinel-2 revisits | Kartik Jindgar et.al. | 2409.17363 | link |
2024-09-25 | Sparsity, Regularization and Causality in Agricultural Yield: The Case of Paddy Rice in Peru | Rita Rocio Guzman-Lopez et.al. | 2409.17298 | null |
2024-09-25 | SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking | Luigi Russo et.al. | 2409.17087 | null |
2024-09-25 | Sub-Meter Remote Sensing of Soil Moisture Using Portable L-band Microwave Radiometer | Runze Zhang et.al. | 2409.17024 | null |
2024-09-24 | Optical multi-beam steering and communication using integrated acousto-optics arrays | Qixuan Lin et.al. | 2409.16511 | null |
2024-09-24 | CDChat: A Large Multimodal Model for Remote Sensing Change Description | Mubashir Noman et.al. | 2409.16261 | link |
2024-09-24 | Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation | Hannah Kerner et.al. | 2409.16252 | link |
2024-09-23 | Estimating the total energy content in escaping accelerated solar electron beams | Alexander W. James et.al. | 2409.15091 | null |
2024-09-23 | Functional control of anomalous reflection via engineered metagratings without polarization limitations | Jingwen Li et.al. | 2409.14663 | null |
2024-09-21 | Cloud Adversarial Example Generation for Remote Sensing Image Classification | Fei Ma et.al. | 2409.14240 | null |
2024-09-21 | A Sinkhorn Regularized Adversarial Network for Image Guided DEM Super-resolution using Frequency Selective Hybrid Graph Transformer | Subhajit Paul et.al. | 2409.14198 | null |
2024-09-20 | Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation | Sen Lei et.al. | 2409.13637 | link |
2024-09-20 | Tackling fluffy clouds: field boundaries detection using time series of S2 and/or S1 imagery | Foivos I. Diakogiannis et.al. | 2409.13568 | link |
2024-09-20 | PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images | Nanqing Liu et.al. | 2409.13401 | link |
2024-09-20 | RingMo-Aerial: An Aerial Remote Sensing Foundation Model With A Affine Transformation Contrastive Learning | Wenhui Diao et.al. | 2409.13366 | null |
2024-09-20 | A Novel Adaptive Fine-Tuning Algorithm for Multimodal Models: Self-Optimizing Classification and Selection of High-Quality Datasets in Remote Sensing | Yi Ren et.al. | 2409.13345 | null |
2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
2024-09-19 | Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning | Cong Yang et.al. | 2409.12612 | link |
2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | null |
2024-09-18 | Multi-Sensor Deep Learning for Glacier Mapping | Codruţ-Andrei Diaconu et.al. | 2409.12034 | null |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-18 | Photothermal Spectroscopy for Planetary Sciences: Mid-IR Absorption Made Easy | Christopher Cox et.al. | 2409.11626 | null |
2024-09-17 | Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark | Clifford Broni-Bediako et.al. | 2409.11227 | link |
2024-09-17 | On-policy Actor-Critic Reinforcement Learning for Multi-UAV Exploration | Ali Moltajaei Farid et.al. | 2409.11058 | null |
2024-09-16 | Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation | Hanbo Bi et.al. | 2409.10389 | null |
2024-09-22 | Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data | Roni Blushtein-Livnon et.al. | 2409.10272 | null |
2024-09-16 | BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images | Wentao Wang et.al. | 2409.10269 | null |
2024-09-15 | Fuzzy logic for reconstructing arbitrary moments of multiplicity distributions | Anar Rustamov et.al. | 2409.09814 | null |
2024-09-15 | SITSMamba for Crop Classification based on Satellite Image Time Series | Xiaolei Qin et.al. | 2409.09673 | link |
2024-09-19 | Unsupervised Hyperspectral and Multispectral Image Blind Fusion Based on Deep Tucker Decomposition Network with Spatial-Spectral Manifold Learning | He Wang et.al. | 2409.09670 | link |
2024-09-14 | Detecting Looted Archaeological Sites from Satellite Image Time Series | Elliot Vincent et.al. | 2409.09432 | link |
2024-09-14 | NBBOX: Noisy Bounding Box Improves Remote Sensing Object Detection | Yechan Kim et.al. | 2409.09424 | null |
2024-09-14 | Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery | Wei Liu et.al. | 2409.09244 | null |
2024-09-13 | Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing | Minh-Duc Vu et.al. | 2409.08885 | null |
2024-09-13 | ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction Tuning | Pei Deng et.al. | 2409.08582 | null |
2024-09-13 | VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation | Ezra MacDonald et.al. | 2409.08461 | link |
2024-09-12 | Ultra-wideband integrated microwave photonic multi-parameter measurement system on thin-film lithium niobate | Yong Zheng et.al. | 2409.07817 | null |
2024-09-12 | Open-Vocabulary Remote Sensing Image Semantic Segmentation | Qinglong Cao et.al. | 2409.07683 | link |
2024-09-11 | The Mismeasure of Weather: Using Remotely Sensed Earth Observation Data in Economic Context | Anna Josephson et.al. | 2409.07506 | null |
2024-09-11 | Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations | Keumgang Cha et.al. | 2409.07048 | null |
2024-09-11 | Insight Any Instance: Promptable Instance Segmentation for Remote Sensing Images | Xuexue Li et.al. | 2409.07022 | null |
2024-09-10 | PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation | Yin Hu et.al. | 2409.06309 | null |
2024-09-09 | Real-time optical gas sensing with two-dimensional materials | Gia Quyet Ngo et.al. | 2409.05693 | null |
2024-09-09 | AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations | Jingtao Li et.al. | 2409.05679 | null |
2024-09-09 | Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery | Fan Zhang et.al. | 2409.05624 | null |
2024-09-09 | Localization of macroscopic sources of magnetic field using optical fibers doped with NV-rich sub-micron diamonds and zero-field resonance | Mariusz Mrózek et.al. | 2409.05452 | null |
2024-09-06 | Ab initio quantum dynamics as a scalable solution to the exoplanet opacity challenge: A case study of CO $_2$ in hydrogen atmosphere | Laurent Wiesenfeld et.al. | 2409.04439 | null |
2024-09-06 | How to Identify Good Superpixels for Deforestation Detection on Tropical Rainforests | Isabela Borlido et.al. | 2409.04330 | null |
2024-09-06 | An OpenMetBuoy dataset of Marginal Ice Zone dynamics collected around Svalbard in 2022 and 2023 | Jean Rabault et.al. | 2409.04151 | link |
2024-09-05 | Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning | Isaac Ray et.al. | 2409.03938 | null |
2024-09-05 | On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models | Thanh-Dung Le et.al. | 2409.03901 | link |
2024-09-09 | UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images | Lulin Li et.al. | 2409.03431 | link |
2024-09-04 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-09-03 | Impact Evaluations in Data Poor Settings: The Case of Stress-Tolerant Rice Varieties in Bangladesh | Jeffrey D. Michler et.al. | 2409.02201 | null |
2024-09-03 | Brain-Inspired Online Adaptation for Remote Sensing with Spiking Neural Network | Dexin Duan et.al. | 2409.02146 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-01 | Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification | Karim El Khoury et.al. | 2409.00698 | link |
2024-08-31 | Incremental Open-set Domain Adaptation | Sayan Rakshit et.al. | 2409.00530 | null |
2024-08-31 | Mapping earth mounds from space | Baki Uzun et.al. | 2409.00518 | null |
2024-08-31 | Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss | Shivam Pande et.al. | 2409.00513 | null |
2024-08-31 | Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi’s domain adaptability | Chia-Yu Hsu et.al. | 2409.00489 | null |
2024-08-31 | Self-supervised Fusarium Head Blight Detection with Hyperspectral Image and Feature Mining | Yu-Fan Lin et.al. | 2409.00395 | null |
2024-08-30 | FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition | Chen Hu et.al. | 2408.17090 | link |
2024-08-29 | Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification | Yu Liang et.al. | 2408.16265 | null |
2024-08-28 | A Survey on Evaluation of Multimodal Large Language Models | Jiaxing Huang et.al. | 2408.15769 | null |
2024-08-28 | Can SAR improve RSVQA performance? | Lucrezia Tosato et.al. | 2408.15642 | null |
2024-08-27 | RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models | Junyao Ge et.al. | 2408.14744 | link |
2024-08-26 | MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification | Feng Gao et.al. | 2408.14255 | link |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-25 | GeoPlant: Spatial Plant Species Prediction Dataset | Lukas Picek et.al. | 2408.13928 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | A plug-and-play framework for curvilinear structure segmentation based on a learned reconnecting regularization | Sophie Carneiro-Esteves et.al. | 2408.12943 | null |
2024-08-22 | Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification | Han Luo et.al. | 2408.12760 | null |
2024-08-22 | Research on Improved U-net Based Remote Sensing Image Segmentation Algorithm | Qiming Yang et.al. | 2408.12672 | null |
2024-08-26 | UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Enze Zhu et.al. | 2408.11545 | link |
2024-08-21 | High Performance Simulation of Spaceborne Radar for Remote-Sensing Oceanography: Application to an Altimetry Scenario | Goulven Monnier et.al. | 2408.11472 | null |
2024-08-21 | Near-Field Signal Processing: Unleashing the Power of Proximity | Ahmet M. Elbir et.al. | 2408.11434 | null |
2024-08-20 | Unified Deep Learning Model for Global Prediction of Aboveground Biomass, Canopy Height and Cover from High-Resolution, Multi-Sensor Satellite Imagery | Manuel Weber et.al. | 2408.11234 | null |
2024-08-20 | Reactive molecular dynamics simulations of micrometeoroid bombardment for space weathering of asteroid (162173) Ryugu | Daigo Shoji et.al. | 2408.10959 | null |
2024-08-20 | Novel Change Detection Framework in Remote Sensing Imagery Using Diffusion Models and Structural Similarity Index (SSIM) | Andrew Kiruluta et.al. | 2408.10619 | null |
2024-08-19 | Assessment of Spectral based Solutions for the Detection of Floating Marine Debris | Muhammad Alì et.al. | 2408.10187 | null |
2024-08-17 | Pursuing Truth: Improving Retrievals on Mid-Infrared Exo-Earth Spectra with Physically Motivated Water Abundance Profiles and Cloud Models | Björn S. Konrad et.al. | 2408.09129 | null |
2024-08-17 | Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community | Jiancheng Pan et.al. | 2408.09110 | link |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tuning a SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation | Linghao Zheng et.al. | 2408.08576 | null |
2024-08-16 | Improving the measurement of air-water flow properties using remote distance sensing technology | Matthias Kramer et.al. | 2408.08466 | null |
2024-08-15 | SpectralEarth: Training Hyperspectral Foundation Models at Scale | Nassim Ait Ali Braham et.al. | 2408.08447 | null |
2024-08-15 | The Dawn of KAN in Image-to-Image (I2I) Translation: Integrating Kolmogorov-Arnold Networks with GANs for Unpaired I2I Translation | Arpan Mahara et.al. | 2408.08216 | link |
2024-08-15 | The Effect of Horizontal Shear on Extracting Water Currents From Surface Wave Data | Stefan Weichert et.al. | 2408.08197 | null |
2024-08-15 | Treat Stillness with Movement: Remote Sensing Change Detection via Coarse-grained Temporal Foregrounds Mining | Xixi Wang et.al. | 2408.08078 | link |
2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null |
2024-08-14 | Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction | Liting Jiang et.al. | 2408.07419 | link |
2024-08-15 | Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2 | Osher Rafaeli et.al. | 2408.06970 | null |
2024-08-14 | A Comprehensive Survey on Synthetic Infrared Image synthesis | Avinash Upadhyay et.al. | 2408.06868 | null |
2024-08-13 | IFShip: A Large Vision-Language Model for Interpretable Fine-grained Ship Classification via Domain Knowledge-Enhanced Instruction Tuning | Mingning Guo et.al. | 2408.06631 | null |
2024-08-12 | On the Peril of Inferring Phytoplankton Properties from Remote-Sensing Observations | J. Xavier Prochaska et.al. | 2408.06149 | null |
2024-08-11 | Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task | Hannuo Zhang et.al. | 2408.05777 | null |
2024-08-09 | Modeling and Analysis of Downlink Communications in a Heterogeneous LEO Satellite Network | Chang-Sik Choi et.al. | 2408.05070 | null |
2024-08-08 | AI for operational methane emitter monitoring from space | Anna Vaughan et.al. | 2408.04745 | null |
2024-08-08 | Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation | Daniele Rege Cambrin et.al. | 2408.04523 | link |
2024-08-08 | Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction | Yuchen Wang et.al. | 2408.04294 | null |
2024-08-08 | Quantum-Enhanced Polarimetric Imaging | Meng-Yu Xie et.al. | 2408.04183 | null |
2024-08-08 | Integrated Dynamic Phenological Feature for Remote Sensing Image Land Cover Change Detection | Yi Liu et.al. | 2408.04144 | null |
2024-08-07 | Prospects for using drones to test formation-flying CubeSat concepts, and other astronomical applications | John D. Monnier et.al. | 2408.03911 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-06 | AI Foundation Models in Remote Sensing: A Survey | Siqi Lu et.al. | 2408.03464 | null |
2024-08-04 | Masked Angle-Aware Autoencoder for Remote Sensing Images | Zhihao Li et.al. | 2408.01946 | link |
2024-08-03 | Quantum Lotka-Volterra dynamics | Yuechun Jiao et.al. | 2408.01726 | null |
2024-08-02 | Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives | Lei Ma et.al. | 2408.01607 | null |
2024-07-30 | SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition | Hao Tan et.al. | 2407.20920 | null |
2024-07-29 | Urban Traffic Accident Risk Prediction Revisited: Regionality, Proximity, Similarity and Sparsity | Minxiao Chen et.al. | 2407.19668 | link |
2024-07-29 | Towards a Knowledge guided Multimodal Foundation Model for Spatio-Temporal Remote Sensing Applications | Praveen Ravirathinam et.al. | 2407.19660 | null |
2024-07-25 | HAMSTER: Hyperspectral Albedo Maps dataset with high Spatial and TEmporal Resolution | Giulia Roccetti et.al. | 2407.18030 | null |
2024-07-24 | An Energy-Efficient Artefact Detection Accelerator on FPGAs for Hyper-Spectral Satellite Imagery | Cornell Castelino et.al. | 2407.17647 | null |
2024-07-24 | EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification | Joana Reuss et.al. | 2407.17458 | null |
2024-07-24 | CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction | Paul Goyes-Peñafiel et.al. | 2407.17402 | link |
2024-07-24 | Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks | Alessandro Sebastianelli et.al. | 2407.17108 | null |
2024-07-23 | Integrating Biological Data into Autonomous Remote Sensing Systems for In Situ Imageomics: A Case Study for Kenyan Animal Behavior Sensing with Unmanned Aerial Vehicles (UAVs) | Jenna M. Kline et.al. | 2407.16864 | null |
2024-07-23 | A Multitask Deep Learning Model for Classification and Regression of Hyperspectral Images: Application to the large-scale dataset | Koushikey Chhapariya et.al. | 2407.16384 | null |
2024-07-23 | Sizey: Memory-Efficient Execution of Scientific Workflow Tasks | Jonathan Bader et.al. | 2407.16353 | null |
2024-07-23 | HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis | Fangqin Zhou et.al. | 2407.16269 | link |
2024-07-23 | Cross-Domain Separable Translation Network for Multimodal Image Change Detection | Tao Zhan et.al. | 2407.16158 | link |
2024-07-24 | Self-driving lab discovers principles for steering spontaneous emission | Saaketh Desai et.al. | 2407.16083 | null |
2024-07-22 | EfficientCD: A New Strategy For Change Detection Based With Bi-temporal Layers Exchanged | Sijun Dong et.al. | 2407.15999 | link |
2024-07-22 | PRIME: Blind Multispectral Unmixing Using Virtual Quantum Prism and Convex Geometry | Chia-Hsiang Lin et.al. | 2407.15358 | null |
2024-07-22 | Fever Detection with Infrared Thermography: Enhancing Accuracy through Machine Learning Techniques | Parsa Razmara et.al. | 2407.15302 | null |
2024-07-21 | Rethinking Feature Backbone Fine-tuning for Remote Sensing Object Detection | Yechan Kim et.al. | 2407.15143 | null |
2024-07-20 | PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction | Weiqin Jiao et.al. | 2407.14912 | link |
2024-07-20 | CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation | Yukai Shi et.al. | 2407.14823 | link |
2024-07-20 | Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures | Jiaxing Huang et.al. | 2407.14754 | link |
2024-07-20 | $\infty$ -Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions | Minh-Quan Le et.al. | 2407.14709 | null |
2024-07-25 | Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images | Bo Yuan et.al. | 2407.14242 | link |
2024-07-19 | The Cardinality of Identifying Code Sets for Soccer Ball Graph with Application to Remote Sensing | Anna L. D. Latour et.al. | 2407.14120 | link |
2024-07-19 | Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance | Yongshuo Zhu et.al. | 2407.14032 | null |
2024-07-18 | Quantifying uncertainty in area and regression coefficient estimation from remote sensing maps | Kerri Lu et.al. | 2407.13659 | link |
2024-07-20 | EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension | Wei Zhang et.al. | 2407.13596 | link |
2024-07-18 | Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection | Jiangwei Xie et.al. | 2407.13151 | link |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-17 | UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction | Zeyu Wang et.al. | 2407.11578 | link |
2024-07-17 | RIMformer: An End-to-End Transformer for FMCW Radar Interference Mitigation | Ziang Zhang et.al. | 2407.11459 | null |
2024-07-16 | Mapping savannah woody vegetation at the species level with multispecral drone and hyperspectral EnMAP data | Christina Karakizi et.al. | 2407.11404 | null |
2024-07-14 | Harnessing Feature Clustering For Enhanced Anomaly Detection With Variational Autoencoder And Dynamic Threshold | Tolulope Ale et.al. | 2407.10042 | null |
2024-07-13 | MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection | Ziyue Huang et.al. | 2407.09920 | link |
2024-07-11 | Segmentation-guided Attention for Visual Question Answering from Remote Sensing Images | Lucrezia Tosato et.al. | 2407.08669 | null |
2024-07-11 | Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration | Shuang Xu et.al. | 2407.08509 | null |
2024-07-11 | Paving the way toward foundation models for irregular and unaligned Satellite Image Time Series | Iris Dumeur et.al. | 2407.08448 | null |
2024-07-11 | XAI-Guided Enhancement of Vegetation Indices for Crop Mapping | Hiba Najjar et.al. | 2407.08298 | null |
2024-07-11 | Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing | Hiba Najjar et.al. | 2407.08274 | null |
2024-07-11 | DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing | Minghang Zhou et.al. | 2407.08132 | null |
2024-07-10 | PaliGemma: A versatile 3B VLM for transfer | Lucas Beyer et.al. | 2407.07726 | link |
2024-07-10 | The deep oxygen abundance in Solar System Giant Planets, with a new derivation for Saturn | Thibault Cavalié et.al. | 2407.07515 | null |
2024-07-10 | Bayesian weighted time-lapse full-waveform inversion using a receiver-extension strategy | Sergio Luiz E. F. da Silva et.al. | 2407.07467 | null |
2024-07-13 | Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu et.al. | 2407.07307 | link |
2024-07-10 | Identity-enabled CDMA LiDAR for massively parallel ranging with a single-element receiver | Yixiu Shen et.al. | 2407.06918 | null |
2024-07-08 | A Mamba-based Siamese Network for Remote Sensing Change Detection | Jay N. Paranjape et.al. | 2407.06839 | link |
2024-07-08 | Tile Compression and Embeddings for Multi-Label Classification in GeoLifeCLEF 2024 | Anthony Miyaguchi et.al. | 2407.06326 | link |
2024-07-07 | Addressing single object tracking in satellite imagery through prompt-engineered solutions | Athena Psalta et.al. | 2407.05518 | null |
2024-07-07 | HyperKAN: Kolmogorov-Arnold Networks make Hyperspectral Image Classificators Smarter | Valeriy Lobanov et.al. | 2407.05278 | link |
2024-07-07 | Estimation of the Area and Precipitation Associated with a Tropical Cyclone Biparjoy by using Image Processing | Shikha Verma et.al. | 2407.05255 | null |
2024-07-06 | BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support | Vladyslav Polushko et.al. | 2407.05007 | null |
2024-07-04 | MineNetCD: A Benchmark for Global Mining Change Detection on Remote Sensing Imagery | Weikang Yu et.al. | 2407.03971 | link |
2024-07-04 | High-Frequency Radar observation of strong and contrasted currents: the Alderney race paradigm | Dylan Dumas et.al. | 2407.03827 | null |
2024-07-04 | reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis | Kai Norman Clasen et.al. | 2407.03653 | link |
2024-07-03 | Relating CNN-Transformer Fusion Network for Change Detection | Yuhao Gao et.al. | 2407.03178 | link |
2024-07-03 | ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation | Chang Li et.al. | 2407.03033 | null |
2024-07-03 | Style Alignment based Dynamic Observation Method for UAV-View Geo-localization | Jie Shao et.al. | 2407.02832 | null |
2024-07-08 | Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction | Tinghuai Wang et.al. | 2407.02639 | null |
2024-07-02 | Efficient Stochastic Differential Equation for DEM Super Resolution with Void Filling | Tongtong Zhang et.al. | 2407.01908 | null |
2024-06-26 | Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Younghyun Koo et.al. | 2407.01464 | null |
2024-07-01 | Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives | Matteo Ciotola et.al. | 2407.01355 | link |
2024-07-01 | Small Aerial Target Detection for Airborne Infrared Detection Systems using LightGBM and Trajectory Constraints | Xiaoliang Sun et.al. | 2407.01278 | null |
2024-07-01 | FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing | Donghyun Kim et.al. | 2407.00972 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-06-30 | Prediction of Sentinel-2 multi-band imagery with attention BiLSTM for continuous earth surface monitoring | Weiying Zhao et.al. | 2407.00834 | null |
2024-06-30 | Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data | Bas Peters et.al. | 2407.00595 | null |
2024-06-29 | SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City | Guohao Wang et.al. | 2407.00296 | link |
2024-06-28 | Monolithic lithium niobate photonic chip for efficient terahertz-optic modulation and terahertz generation | Yiwen Zhang et.al. | 2406.19620 | null |
2024-06-27 | Cost-efficient Active Illumination Camera For Hyper-spectral Reconstruction | Yuxuan Zhang et.al. | 2406.19560 | null |
2024-06-27 | Secure quantum-enhanced measurements on a network of sensors | Sean William Moore et.al. | 2406.19285 | null |
2024-06-27 | Simultaneous determination of the dielectric relaxation behavior and soilwater characteristic curve of undisturbed soil samples | Norman Wagner et.al. | 2406.18909 | null |
2024-06-26 | Evaluating and Benchmarking Foundation Models for Earth Observation and Geospatial AI | Nikolaos Dionelis et.al. | 2406.18295 | null |
2024-06-26 | CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data | Nikolaos Dionelis et.al. | 2406.18279 | null |
2024-06-26 | SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery | Jian Song et.al. | 2406.18151 | link |
2024-06-26 | Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model | Zhuo Zheng et.al. | 2406.17998 | link |
2024-06-25 | Cross-Modal Spherical Aggregation for Weakly Supervised Remote Sensing Shadow Removal | Kaichen Chi et.al. | 2406.17469 | null |
2024-06-25 | Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration | Sebastian Hafner et.al. | 2406.17458 | link |
2024-06-24 | Quantifying Heterogeneous Ecosystem Services With Multi-Label Soft Classification | Zhihui Tian et.al. | 2406.17147 | null |
2024-06-19 | Generative Data Assimilation of Sparse Weather Station Observations at Kilometer Scales | Peter Manshausen et.al. | 2406.16947 | link |
2024-06-24 | Multi-Modal Vision Transformers for Crop Mapping from Satellite Image Time Series | Theresa Follath et.al. | 2406.16513 | null |
2024-07-02 | LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery | Xiaowen Ma et.al. | 2406.16502 | link |
2024-06-23 | UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery | Pengfei Zhang et.al. | 2406.16129 | null |
2024-06-22 | Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection | Zhuo Zheng et.al. | 2406.15694 | link |
2024-06-21 | Miniature fluorescence sensor for quantitative detection of brain tumour | Jean Pierre Ndabakuranye et.al. | 2406.15520 | null |
2024-06-21 | Rethinking Remote Sensing Change Detection With A Mask View | Xiaowen Ma et.al. | 2406.15320 | link |
2024-06-21 | Understanding the variability of helium abundance in the solar corona using three-fluid modeling and UV observations | Leon Ofman et.al. | 2406.14897 | null |
2024-07-01 | Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery | Ilham Adi Panuntun et.al. | 2406.14220 | null |
2024-06-20 | Semi Supervised Heterogeneous Domain Adaptation via Disentanglement and Pseudo-Labelling | Cassio F. Dantas et.al. | 2406.14087 | link |
2024-06-20 | Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images | Qinfeng Zhu et.al. | 2406.14086 | link |
2024-06-21 | CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification | Faxu Guo et.al. | 2406.14080 | null |
2024-06-19 | Locating and measuring marine aquaculture production from space: a computer vision approach in the French Mediterranean | Sebastian Quaade et.al. | 2406.13847 | null |
2024-06-22 | Velocity Analysis of Moving Objects in Earth Observation Satellite Images Using Multi-Spectral Push Broom Scanning | Eric Keto et.al. | 2406.13710 | null |
2024-06-19 | DDLNet: Boosting Remote Sensing Change Detection with Dual-Domain Learning | Xiaowen Ma et.al. | 2406.13606 | link |
2024-06-19 | Formation of a Magnetic Cloud from the Merging of Two Successive Coronal Mass Ejections | Chong Chen et.al. | 2406.13603 | null |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | Multi-scale Restoration of Missing Data in Optical Time-series Images with Masked Spatial-Temporal Attention Network | Zaiyan Zhang et.al. | 2406.13358 | link |
2024-06-18 | Scale-Translation Equivariant Network for Oceanic Internal Solitary Wave Localization | Zhang Wan et.al. | 2406.13060 | link |
2024-06-18 | ChangeViT: Unleashing Plain Vision Transformers for Change Detection | Duowang Zhu et.al. | 2406.12847 | link |
2024-06-21 | Windows Into Other Worlds: Pitfalls in the physical interpretation of exoplanet atmospheric spectroscopy | Darius Modirrousta-Galian et.al. | 2406.12765 | null |
2024-06-18 | RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding | Linrui Xu et.al. | 2406.12479 | link |
2024-06-18 | VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding | Xiang Li et.al. | 2406.12384 | link |
2024-06-17 | Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset | Fengxiang Wang et.al. | 2406.11933 | link |
2024-06-17 | HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model | Di Wang et.al. | 2406.11519 | link |
2024-06-17 | Diffusion Models in Low-Level Vision: A Survey | Chunming He et.al. | 2406.11138 | link |
2024-06-16 | ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model | Song Zhang et.al. | 2406.10855 | link |
2024-06-16 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery | Libo Wang et.al. | 2406.10828 | link |
2024-06-15 | Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft | Ian Vyse et.al. | 2406.10724 | link |
2024-06-14 | Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval | Genc Hoxha et.al. | 2406.10107 | null |
2024-06-14 | SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding | Junwei Luo et.al. | 2406.10100 | link |
2024-06-14 | Soil nitrogen forecasting from environmental variables provided by multisensor remote sensing images | Weiying Zhao et.al. | 2406.09812 | null |
2024-06-13 | Modelling the magnetic vectors of ICMEs at different heliocentric distances with INFROS | Ranadeep Sarkar et.al. | 2406.09247 | null |
2024-06-16 | A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Lixian Zhang et.al. | 2406.08079 | null |
2024-06-12 | Deep Learning for Slum Mapping in Remote Sensing Images: A Meta-analysis and Review | Anjali Raj et.al. | 2406.08031 | null |
2024-06-12 | Real-time, chirped-pulse heterodyne detection at room-temperature with 100GHz 3dB-bandwidth mid-infrared quantum-well photodetectors | Quyang Lin et.al. | 2406.08027 | null |
2024-06-11 | Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery | Biplov Bhandari et.al. | 2406.07482 | link |
2024-06-11 | Characterizing GPROF Regional Bias Using Radar-Derived Hydrometeor Information | Eric Goldenstern et.al. | 2406.07344 | null |
2024-06-11 | Grapevine Disease Prediction Using Climate Variables from Multi-Sensor Remote Sensing Imagery via a Transformer Model | Weiying Zhao et.al. | 2406.07094 | null |
2024-06-11 | RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents | Wenjia Xu et.al. | 2406.07089 | null |
2024-06-11 | RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Zhechao Wang et.al. | 2406.07032 | null |
2024-06-10 | An Elliptic Kernel Unsupervised Autoencoder-Graph Convolutional Network Ensemble Model for Hyperspectral Unmixing | Estefania Alfaro-Mejia et.al. | 2406.06742 | null |
2024-06-10 | ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery | Xian Sun et.al. | 2406.06028 | null |
2024-06-09 | BOSC: A toolbox for aerial imagery mapping | Ricard Durall et.al. | 2406.05833 | link |
2024-06-09 | Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment | Zijia Song et.al. | 2406.05766 | null |
2024-06-15 | A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Hou-I Liu et.al. | 2406.05755 | link |
2024-06-09 | HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model | Hang Fu et.al. | 2406.05700 | link |
2024-06-09 | SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection | Hongjia Chen et.al. | 2406.05668 | link |
2024-06-09 | Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision | Pranav Jeevan et.al. | 2406.05612 | link |
2024-06-08 | A Deep Learning-Augmented Stand-off Radar Scheme for Rapidly Detecting Tree Defects | Jiwei Qian et.al. | 2406.05389 | null |
2024-06-07 | Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Venkanna Babu Guthula et.al. | 2406.04949 | null |
2024-06-07 | MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description | Cong Yang et.al. | 2406.04716 | link |
2024-06-07 | UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Pengju Tian et.al. | 2406.04648 | null |
2024-06-06 | SpectralZoom: Efficient Segmentation with an Adaptive Hyperspectral Camera | Jackson Arnold et.al. | 2406.04287 | null |
2024-06-06 | M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and RGB Data | Matthew J Allen et.al. | 2406.04230 | link |
2024-06-06 | CDMamba: Remote Sensing Image Change Detection with Mamba | Haotian Zhang et.al. | 2406.04207 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | link |
2024-06-09 | Partial Label Learning with Focal Loss for Sea Ice Classification Based on Ice Charts | Behzad Vahedi et.al. | 2406.03645 | link |
2024-06-05 | Foundation Models for Geophysics: Reviews and Perspectives | Qi Liu et.al. | 2406.03163 | null |
2024-06-05 | P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images | Tao Zhang et.al. | 2406.02930 | null |
2024-06-04 | Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images | Xinyang Pu et.al. | 2406.02385 | link |
2024-06-03 | Sparse Focus Network for Multi-Source Remote Sensing Data Classification | Xuepeng Jin et.al. | 2406.01245 | null |
2024-06-03 | LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism | Miao Fu et.al. | 2406.01228 | null |
2024-06-02 | Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing | Minjong Cheon et.al. | 2406.00600 | link |
2024-06-04 | Analyzing trends for agricultural decision support system using twitter data | Sneha Jha et.al. | 2406.00577 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ImplicitTerrain: a Continuous Surface Model for Terrain Data Analysis | Haoan Feng et.al. | 2406.00227 | null |
2024-05-31 | Responsible AI for Earth Observation | Pedram Ghamisi et.al. | 2405.20868 | null |
2024-05-31 | Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network | Lorenzo Innocenti et.al. | 2405.20731 | null |
2024-05-30 | P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation | Qi Zhang et.al. | 2405.20443 | link |
2024-05-30 | FMARS: Annotating Remote Sensing Images for Disaster Management using Foundation Models | Edoardo Arnaudo et.al. | 2405.20109 | link |
2024-05-30 | Rapid Wildfire Hotspot Detection Using Self-Supervised Learning on Temporal Remote Sensing Data | Luca Barco et.al. | 2405.20093 | link |
2024-05-30 | Recipes for forming a carbon-rich giant planet | Olivier Mousis et.al. | 2405.19748 | null |
2024-05-30 | Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes | Yong-Qiang Mao et.al. | 2405.19735 | null |
2024-05-30 | Research on Foundation Model for Spatial Data Intelligence: China’s 2024 White Paper on Strategic Development of Spatial Data Intelligence | Shaohua Wang et.al. | 2405.19730 | null |
2024-06-02 | Large-scale DSM registration via motion averaging | Ningli Xu et.al. | 2405.19442 | null |
2024-06-05 | FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic Understanding | Shuai Yuan et.al. | 2405.19055 | link |
2024-05-29 | Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval | Rui Yang et.al. | 2405.18959 | link |
2024-05-29 | MAGIC: Modular Auto-encoder for Generalisable Model Inversion with Bias Corrections | Yihang She et.al. | 2405.18953 | link |
2024-05-29 | Spectral Fidelity and Spatial Enhancement: An Assessment and Cascading of Pan-Sharpening Techniques for Satellite Imagery | Abdul Aziz A. B et.al. | 2405.18900 | null |
2024-05-29 | Refinement of global coronal and interplanetary magnetic field extrapolations constrained by remote-sensing and in-situ observations at the solar minimum | Guanglu Shi et.al. | 2405.18665 | null |
2024-05-28 | Probing the Information Theoretical Roots of Spatial Dependence Measures | Zhangyu Wang et.al. | 2405.18459 | link |
2024-05-28 | SSLChange: A Self-supervised Change Detection Framework Based on Domain Adaptation | Yitao Zhao et.al. | 2405.18224 | link |
2024-05-28 | Near-Infrared and Low-Rank Adaptation of Vision Transformers in Remote Sensing | Irem Ulku et.al. | 2405.17901 | null |
2024-05-28 | Towards Efficient Disaster Response via Cost-effective Unbiased Class Rate Estimation through Neyman Allocation Stratified Sampling Active Learning | Yanbing Bai et.al. | 2405.17734 | null |
2024-05-27 | Robust Perception and Navigation of Autonomous Surface Vehicles in Challenging Environments | Mingi Jeong et.al. | 2405.17657 | null |
2024-05-27 | Refraction FWI of a circular shot OBN acquisition in the Brazilian pre-salt region | Sérgio Luiz E. F. da Silva et.al. | 2405.17330 | null |
2024-05-27 | Deep Feature Gaussian Processes for Single-Scene Aerosol Optical Depth Reconstruction | Shengjie Liu et.al. | 2405.17262 | null |
2024-05-27 | SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing | Yong-Qiang Mao et.al. | 2405.17140 | null |
2024-05-27 | Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification | Shujun Yang et.al. | 2405.17110 | link |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-24 | Multimodal Object Detection via Probabilistic a priori Information Integration | Hafsa El Hafyani et.al. | 2405.15596 | link |
2024-05-29 | Composed Image Retrieval for Remote Sensing | Bill Psomas et.al. | 2405.15587 | link |
2024-05-24 | MagicBathyNet: A Multimodal Remote Sensing Dataset for Bathymetry Prediction and Pixel-based Classification in Shallow Waters | Panagiotis Agrafiotis et.al. | 2405.15477 | link |
2024-05-24 | Comparing remote sensing-based forest biomass mapping approaches using new forest inventory plots in contrasting forests in northeastern and southwestern China | Wenquan Dong et.al. | 2405.15438 | null |
2024-05-24 | Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification | Barış Büyüktaş et.al. | 2405.15405 | null |
2024-05-24 | Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets | Hoàng-Ân Lê et.al. | 2405.15394 | link |
2024-05-23 | Dual-comb correlation spectroscopy of thermal light | Eugene J. Tsao et.al. | 2405.14842 | null |
2024-05-23 | Multi-view Remote Sensing Image Segmentation With SAM priors | Zipeng Qi et.al. | 2405.14171 | null |
2024-05-23 | Hyperspectral Image Dataset for Individual Penguin Identification | Youta Noboru et.al. | 2405.14146 | null |
2024-05-22 | AutoLCZ: Towards Automatized Local Climate Zone Mapping from Rule-Based Remote Sensing | Chenying Liu et.al. | 2405.13993 | null |
2024-05-22 | Embedding Generalized Semantic Knowledge into Few-Shot Remote Sensing Segmentation | Yuyu Jia et.al. | 2405.13686 | null |
2024-05-22 | MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation | Zhiping Yu et.al. | 2405.13570 | null |
2024-05-22 | Euclid. I. Overview of the Euclid mission | Euclid Collaboration et.al. | 2405.13491 | null |
2024-05-22 | A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification | Tom Burgert et.al. | 2405.13451 | null |
2024-05-21 | Global-Local Detail Guided Transformer for Sea Ice Recognition in Optical Remote Sensing Images | Zhanchao Huang et.al. | 2405.13197 | null |
2024-05-21 | Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images | Xiaofei Yu et.al. | 2405.12875 | link |
2024-05-21 | 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification | Yan He et.al. | 2405.12487 | null |
2024-05-25 | Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | Weilian Zhou et.al. | 2405.12003 | link |
2024-05-20 | Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood Modelling | Masato Sakai et.al. | 2405.11814 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Ptychographic non-line-of-sight imaging for depth-resolved visualization of hidden objects | Pengming Song et.al. | 2405.11115 | null |
2024-05-17 | Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian Splatting | Kyle Gao et.al. | 2405.11021 | null |
2024-05-17 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | Mushui Liu et.al. | 2405.10530 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Region-level labels in ice charts can produce pixel-level segmentation for Sea Ice types | Muhammed Patel et.al. | 2405.10456 | null |
2024-05-16 | PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | Jiancheng Pan et.al. | 2405.10160 | link |
2024-05-16 | RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing | Huiling Zhou et.al. | 2405.10030 | null |
2024-05-16 | Cross-sensor self-supervised training and alignment for remote sensing | Valerio Marsocci et.al. | 2405.09922 | null |
2024-05-16 | Many-Shot In-Context Learning in Multimodal Foundation Models | Yixing Jiang et.al. | 2405.09798 | link |
2024-05-16 | LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation | Wentao Jiang et.al. | 2405.09789 | link |
2024-05-15 | SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition | Weijie L et.al. | 2405.09365 | link |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-15 | Dim Small Target Detection and Tracking: A Novel Method Based on Temporal Energy Selective Scaling and Trajectory Association | Weihua Gao et.al. | 2405.09054 | null |
2024-05-15 | Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels | Guozhang Liu et.al. | 2405.09024 | null |
2024-05-14 | Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research | Qinglong Cao et.al. | 2405.08668 | link |
2024-05-14 | Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study | Qinfeng Zhu et.al. | 2405.08493 | null |
2024-05-13 | IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data | Ziyang Zhang et.al. | 2405.07916 | null |
2024-05-13 | Sub-percent Characterization and Polarimetric Performance Analysis of Commercial Micro-polarizer Array Detectors | Thijs Stockmans et.al. | 2405.07864 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-10 | Ocean-DC: An analysis ready data cube framework for environmental and climate change monitoring over the port areas | Ioannis Kavouras et.al. | 2405.06730 | null |
2024-05-10 | A Lightweight Transformer for Remote Sensing Image Change Captioning | Dongwei Sun et.al. | 2405.06598 | link |
2024-05-10 | Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios | Qiyan Luo et.al. | 2405.06246 | null |
2024-05-09 | UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks | Kovvuri Sai Gopal Reddy et.al. | 2405.06057 | link |
2024-05-09 | Exploring Text-Guided Single Image Editing for Remote Sensing Images | Fangzhou Han et.al. | 2405.05769 | link |
2024-05-08 | EarthMatch: Iterative Coregistration for Fine-grained Localization of Astronaut Photography | Gabriele Berton et.al. | 2405.05422 | link |
2024-05-08 | Identifying every building’s function in large-scale urban areas with multi-modality remote-sensing data | Zhuohong Li et.al. | 2405.05133 | link |
2024-05-08 | Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution | Yi Xiao et.al. | 2405.04964 | link |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution | Naveed Sultan et.al. | 2405.04595 | null |
2024-05-07 | New allometric models for the USA create a step-change in forest carbon estimation, modeling, and mapping | Lucas K. Johnson et.al. | 2405.04507 | null |
2024-05-07 | Vision Mamba: A Comprehensive Survey and Taxonomy | Xiao Liu et.al. | 2405.04404 | link |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Bidirectional cascaded superfluorescent lasing in air enabled by resonant third harmonic photon exchange from nitrogen to argon | Zan Nie et.al. | 2405.04089 | null |
2024-05-08 | Leafy Spurge Dataset: Real-world Weed Classification Within Aerial Drone Imagery | Kyle Doherty et.al. | 2405.03702 | null |
2024-05-06 | Knowledge-aware Text-Image Retrieval for Remote Sensing Images | Li Mi et.al. | 2405.03373 | null |
2024-05-05 | Spectro-photometry of Phobos simulants: I. Detectability of hydrated minerals and organic bands | Antonin Wargnier et.al. | 2405.02999 | null |
2024-05-04 | Onboard Out-of-Calibration Detection of Deep Learning Models using Conformal Prediction | Protim Bhattacharjee et.al. | 2405.02634 | null |
2024-05-03 | Solution for Authenticity Identification of Typical Target Remote Sensing Images | Yipeng Lin et.al. | 2405.02362 | null |
2024-05-03 | Analysing PolSAR data from vegetation by using the subaperture decomposition approach | J. David Ballester-Berman et.al. | 2405.02007 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation | Yunsong Yang et.al. | 2405.01992 | link |
2024-05-03 | Lightweight Change Detection in Heterogeneous Remote Sensing Images with Online All-Integer Pruning Training | Chengyang Zhang et.al. | 2405.01920 | null |
2024-05-06 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et.al. | 2405.01699 | null |
2024-05-02 | CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation | Chenying Liu et.al. | 2405.01217 | null |
2024-05-02 | MFDS-Net: Multi-Scale Feature Depth-Supervised Network for Remote Sensing Change Detection with Global Semantic and Detail Information | Zhenyang Huang et.al. | 2405.01065 | link |
2024-05-02 | A text-based, generative deep learning model for soil reflectance spectrum simulation in the VIS-NIR (400-2499 nm) bands | Tong Lei et.al. | 2405.01060 | link |
2024-05-02 | Hyperspectral Band Selection based on Generalized 3DTV and Tensor CUR Decomposition | Katherine Henneberger et.al. | 2405.00951 | null |
2024-05-01 | Remote Sensing Data Assimilation with a Chained Hydrologic-hydraulic Model for Flood Forecasting | Thanh Huy Nguyen et.al. | 2405.00567 | null |
2024-05-01 | Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring | Sizhuo Li et.al. | 2405.00514 | link |
2024-04-30 | Context-Aware Mobile Network Performance Prediction Using Network & Remote Sensing Data | Ali Shibli et.al. | 2405.00220 | null |
2024-04-30 | Analysis and Enhancement of Lossless Image Compression in JPEG-XL | Rustam Mamedov et.al. | 2404.19755 | null |
2024-04-30 | Data-Driven Invertible Neural Surrogates of Atmospheric Transmission | James Koch et.al. | 2404.19605 | null |
2024-04-30 | AI techniques for near real-time monitoring of contaminants in coastal waters on board future Phisat-2 mission | Francesca Razzano et.al. | 2404.19586 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-29 | Improving Interpretability of Deep Active Learning for Flood Inundation Mapping Through Class Ambiguity Indices Using Multi-spectral Satellite Imagery | Hyunho Lee et.al. | 2404.19043 | null |
2024-04-27 | Remote Sensing Image Enhancement through Spatiotemporal Filtering | Hessah Albanwan et.al. | 2404.18950 | null |
2024-04-29 | Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing | Leonardo Rossi et.al. | 2404.18924 | link |
2024-05-02 | RSCaMa: Remote Sensing Image Change Captioning with State Space Model | Chenyang Liu et.al. | 2404.18895 | link |
2024-04-29 | Context Matters: Leveraging Spatiotemporal Metadata for Semi-Supervised Learning on Remote Sensing Images | Maximilian Bernhard et.al. | 2404.18583 | link |
2024-04-29 | Multisensor Data Fusion for Automatized Insect Monitoring (KInsecta) | Martin Tschaikner et.al. | 2404.18504 | null |
2024-04-29 | Coupling in situ and remote sensing data to assess $α$- and $β$ -diversity over biogeographic gradients | Maxime Lenormand et.al. | 2404.18485 | link |
2024-04-29 | Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images | Wenbin Guan et.al. | 2404.18426 | null |
2024-05-02 | Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment | Tengjun Huang et.al. | 2404.18253 | link |
2024-04-28 | Flood Data Analysis on SpaceNet 8 Using Apache Sedona | Yanbing Bai et.al. | 2404.18235 | null |
2024-04-28 | Event-scale Internal Tide Variability via X-band Marine Radar | Alexandra J. Simpson et.al. | 2404.18218 | null |
2024-04-27 | Spatial, Temporal, and Geometric Fusion for Remote Sensing Images | Hessah Albanwan et.al. | 2404.17851 | null |
2024-04-27 | RFL-CDNet: Towards Accurate Change Detection via Richer Feature Learning | Yuhang Gan et.al. | 2404.17765 | link |
2024-04-26 | ChangeBind: A Hybrid Change Encoder for Remote Sensing Change Detection | Mubashir Noman et.al. | 2404.17565 | link |
2024-04-26 | Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement | Zishu Yao et.al. | 2404.17400 | link |
2024-04-26 | MCSDNet: Mesoscale Convective System Detection Network via Multi-scale Spatiotemporal Information | Jiajun Liang et.al. | 2404.17186 | link |
2024-04-25 | Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series | Aimi Okabayashi et.al. | 2404.16409 | link |
2024-04-26 | A Survey on Visual Mamba | Hanwei Zhang et.al. | 2404.15956 | null |
2024-04-23 | GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots | Simranjit Singh et.al. | 2404.15500 | null |
2024-04-23 | Lidar-based gas analyzer for remote sensing of atmospheric methane | Viacheslav Meshcherinov et.al. | 2404.15464 | null |
2024-04-23 | Transiting Exoplanet Atmospheres in the Era of JWST | Eliza M. -R. Kempton et.al. | 2404.15430 | null |
2024-04-23 | Spectropolarimetric Radio Imaging of Faint Gyrosynchrotron Emission from a CME : A Possible Indication of the Insufficiency of Homogeneous Models | Devojyoti Kansabanik et.al. | 2404.14714 | null |
2024-04-23 | Unsupervised Domain Adaptation Architecture Search with Self-Training for Land Cover Mapping | Clifford Broni-Bediako et.al. | 2404.14704 | link |
2024-04-22 | PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer | Rui She et.al. | 2404.14034 | null |
2024-04-22 | C2F-SemiCD: A Coarse-to-Fine Semi-Supervised Change Detection Method Based on Consistency Regularization in High-Resolution Remote Sensing Images | Chengxi Han et.al. | 2404.13838 | link |
2024-04-21 | LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing | Tong Wang et.al. | 2404.13659 | null |
2024-04-20 | AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation | Yang Yang et.al. | 2404.13408 | link |
2024-04-20 | StrideNET: Swin Transformer for Terrain Recognition with Dynamic Roughness Extraction | Maitreya Shelare et.al. | 2404.13270 | null |
2024-04-19 | Equivariant Imaging for Self-supervised Hyperspectral Image Inpainting | Shuo Li et.al. | 2404.13159 | null |
2024-04-19 | Recurrent Neural Networks for Modelling Gross Primary Production | David Montero et.al. | 2404.12745 | null |
2024-04-19 | Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Zhuohong Li et.al. | 2404.12721 | link |
2024-04-19 | Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models | Georges Le Bellier et.al. | 2404.12667 | null |
2024-04-18 | Asteroid (101955) Bennu in the Laboratory: Properties of the Sample Collected by OSIRIS-REx | Dante S. Lauretta et.al. | 2404.12536 | null |
2024-04-18 | Advancing Applications of Satellite Photogrammetry: Novel Approaches for Built-up Area Modeling and Natural Environment Monitoring using Stereo/Multi-view Satellite Image-derived 3D Data | Shengxi Gui et.al. | 2404.12487 | null |
2024-04-18 | Radio Observations as an Extrasolar Planet Discovery and Characterization: Interior Structure and Habitability | T. Joseph W. Lazio et.al. | 2404.12348 | null |
2024-04-18 | MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification | Weikang Yu et.al. | 2404.12081 | link |
2024-04-18 | Directional intense terahertz radiation driven by abruptly autofocusing lasers in air | Xiao-Ran Zheng et.al. | 2404.11846 | null |
2024-04-17 | When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery | Yiqun Xie et.al. | 2404.11797 | null |
2024-04-17 | IrrNet: Advancing Irrigation Mapping with Incremental Patch Size Training on Remote Sensing Imagery | Oishee Bintey Hoque et.al. | 2404.11762 | null |
2024-04-17 | GEOBIND: Binding Text, Image, and Audio through Satellite Images | Aayush Dhakal et.al. | 2404.11720 | null |
2024-04-17 | SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Yu Zhong et.al. | 2404.11537 | null |
2024-04-23 | Single-temporal Supervised Remote Change Detection for Domain Generalization | Qiangang Du et.al. | 2404.11326 | null |
2024-04-23 | Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection | Qiangang Du et.al. | 2404.11318 | null |
2024-04-17 | Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured | Hanlin Mo et.al. | 2404.11309 | null |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | Reuse out-of-year data to enhance land cover mappingvia feature disentanglement and contrastive learning | Cassio F. Dantas et.al. | 2404.11114 | null |
2024-04-17 | Integrated Communication, Navigation, and Remote Sensing in LEO Networks with Vehicular Applications | Min Sheng et.al. | 2404.10969 | null |
2024-04-16 | A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery | Ellianna Abrahams et.al. | 2404.10927 | link |
2024-04-16 | Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction | John Francis et.al. | 2404.10626 | null |
2024-04-16 | Polarized Adding Method of Discrete Ordinate Approximation for Ultraviolet-Visible and Near-Infrared Radiative Transfer | Kun Wu et.al. | 2404.10587 | null |
2024-04-16 | Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain | Steve Andreas Immanuel et.al. | 2404.10307 | link |
2024-04-16 | Using Multi-Source Data to Identify High-Emitting Heavy-Duty Diesel Vehicles | Zhuoqian Yang et.al. | 2404.10243 | null |
2024-04-17 | Ultra-Wide Dual-band Rydberg Atomic Receiver Based on Space Division Multiplexing RF-Chip Modules | Li-Hua Zhang et.al. | 2404.09757 | null |
2024-04-15 | On-chip Real-time Hyperspectral Imager with Full CMOS Resolution Enabled by Massively Parallel Neural Network | Junren Wen et.al. | 2404.09500 | null |
2024-04-14 | Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation | Jieyi Tan et.al. | 2404.09292 | null |
2024-04-14 | RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion | Kyle Shih-Huang Lo et.al. | 2404.09290 | link |
2024-04-14 | SyntStereo2Real: Edge-Aware GAN for Remote Sensing Image-to-Image Translation while Maintaining Stereo Constraint | Vasudha Venkatesan et.al. | 2404.09277 | null |
2024-04-14 | Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery | Chengxi Han et.al. | 2404.09179 | link |
2024-04-14 | HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images | Chengxi Han et.al. | 2404.09178 | link |
2024-04-17 | Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives | Yidan Liu et.al. | 2404.08926 | null |
2024-04-13 | ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model | Kai Tang et.al. | 2404.08892 | link |
2024-04-12 | SpectralMamba: Efficient Mamba for Hyperspectral Image Classification | Jing Yao et.al. | 2404.08489 | link |
2024-04-12 | Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Miguel Ortiz del Castillo et.al. | 2404.08399 | null |
2024-04-12 | Quantum integrated sensing and communication via entanglement | Yu-Chen Liu et.al. | 2404.08342 | null |
2024-04-11 | Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models – Technical Challenges and Implications for Monitoring and Verification | Tuong Vy Nguyen et.al. | 2404.07754 | null |
2024-04-11 | Automatic Detection of Dark Ship-to-Ship Transfers using Deep Learning and Satellite Imagery | Ollie Ballinger et.al. | 2404.07607 | null |
2024-04-11 | Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening | Yule Duan et.al. | 2404.07543 | link |
2024-04-11 | Floquet engineering Rydberg sub-THz frequency comb spectroscopy | Li-Hua Zhang et.al. | 2404.07433 | null |
2024-04-11 | Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing | Jaemin Kang et.al. | 2404.07405 | null |
2024-04-10 | Impact of far-side structures observed by Solar Orbiter on coronal and heliospheric wind simulations | Barbara Perri et.al. | 2404.06794 | null |
2024-04-10 | YOLO based Ocean Eddy Localization with AWS SageMaker | Seraj Al Mahmud Mostafa et.al. | 2404.06744 | null |
2024-04-10 | Deep Generative Data Assimilation in Multimodal Setting | Yongquan Qu et.al. | 2404.06665 | link |
2024-04-09 | FlameFinder: Illuminating Obscured Fire through Smoke with Attentive Deep Metric Learning | Hossein Rajoli et.al. | 2404.06653 | null |
2024-04-09 | Raster Forge: Interactive Raster Manipulation Library and GUI for Python | Afonso Oliveira et.al. | 2404.06389 | link |
2024-04-08 | Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery | Ionut M. Motoi et.al. | 2404.05693 | link |
2024-04-08 | Impact of LiDAR visualisations on semantic segmentation of archaeological objects | Raveerat Jaturapitpornchai et.al. | 2404.05512 | null |
2024-04-08 | Pansharpening of PRISMA products for archaeological prospection | Gregory Sech et.al. | 2404.05447 | null |
2024-04-08 | In-Flight Estimation of Instrument Spectral Response Functions Using Sparse Representations | Jihanne El Haouari et.al. | 2404.05298 | null |
2024-04-08 | Empirical Upscaling of Point-scale Soil Moisture Measurements for Spatial Evaluation of Model Simulations and Satellite Retrievals | Yi Yu et.al. | 2404.05229 | null |
2024-04-07 | LRNet: Change detection of high-resolution remote sensing imagery via strategy of localization-then-refinement | Huan Zhong et.al. | 2404.04884 | null |
2024-04-07 | 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions | Weijia Li et.al. | 2404.04823 | link |
2024-04-06 | Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation | Danpei Zhao et.al. | 2404.04608 | null |
2024-04-06 | Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation | Xianping Ma et.al. | 2404.04531 | link |
2024-04-06 | VTR: An Optimized Vision Transformer for SAR ATR Acceleration on FPGA | Sachini Wickramasinghe et.al. | 2404.04527 | null |
2024-04-05 | Sen2Chain: An Open-Source Toolbox for Processing Sentinel-2 Satellite Images and Producing Time-Series of Spectral Indices | Christophe Revillion et.al. | 2404.04305 | null |
2024-04-05 | Deep Learning for Satellite Image Time Series Analysis: A Review | Lynn Miller et.al. | 2404.03936 | null |
2024-04-05 | Real-GDSR: Real-World Guided DSM Super-Resolution via Edge-Enhancing Residual Network | Daniel Panangian et.al. | 2404.03930 | null |
2024-04-03 | Convolutional variational autoencoders for secure lossy image compression in remote sensing | Alessandro Giuliano et.al. | 2404.03696 | null |
2024-04-11 | ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model | Hongruixuan Chen et.al. | 2404.03425 | link |
2024-04-04 | Spatio-Spectral Structure Tensor Total Variation for Hyperspectral Image Denoising and Destriping | Shingo Takemoto et.al. | 2404.03313 | link |
2024-04-07 | Linear Anchored Gaussian Mixture Model for Location and Width Computation of Objects in Thick Line Shape | Nafaa Nacereddine et.al. | 2404.03043 | null |
2024-04-03 | FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery | Safouane El Ghazouali et.al. | 2404.02877 | link |
2024-04-10 | RS-Mamba for Large Remote Sensing Image Dense Prediction | Sijie Zhao et.al. | 2404.02668 | link |
2024-04-03 | RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation | Xianping Ma et.al. | 2404.02457 | link |
2024-04-02 | Remote sensing framework for geological mapping via stacked autoencoders and clustering | Sandeep Nagar et.al. | 2404.02180 | link |
2024-04-03 | ResNet with Integrated Convolutional Block Attention Module for Ship Classification Using Transfer Learning on Optical Satellite Imagery | Ryan Donghan Kwon et.al. | 2404.02135 | null |
2024-04-02 | Satellite Federated Edge Learning: Architecture Design and Convergence Analysis | Yuanming Shi et.al. | 2404.01875 | null |
2024-04-02 | Global Mapping of Exposure and Physical Vulnerability Dynamics in Least Developed Countries using Remote Sensing and Machine Learning | Joshua Dimasaka et.al. | 2404.01748 | null |
2024-04-02 | Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Qinfeng Zhu et.al. | 2404.01705 | link |
2024-04-02 | LR-FPN: Enhancing Remote Sensing Object Detection with Location Refined Feature Pyramid Network | Hanqian Li et.al. | 2404.01614 | null |
2024-04-01 | CMT: Cross Modulation Transformer with Hybrid Loss for Pansharpening | Wen-Jie Shu et.al. | 2404.01121 | null |
2024-04-01 | S2RC-GCN: A Spatial-Spectral Reliable Contrastive Graph Convolutional Network for Complex Land Cover Classification Using Hyperspectral Images | Renxiang Guan et.al. | 2404.00964 | null |
2024-04-01 | A Novel Algorithm for Digital Lithological Mapping-Case Studies in Sri Lanka’s Mineral Exploration | R. M. L. S. Ramanayake et.al. | 2404.00896 | null |
2024-03-31 | Denoising Low-dose Images Using Deep Learning of Time Series Images | Yang Shao et.al. | 2404.00510 | null |
2024-03-30 | HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification | Judy X Yang et.al. | 2404.00272 | link |
2024-03-29 | Multi-Region Transfer Learning for Segmentation of Crop Field Boundaries in Satellite Images with Limited Labels | Hannah Kerner et.al. | 2404.00179 | null |
2024-03-29 | H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model | Chao Pang et.al. | 2403.20213 | link |
2024-03-28 | Dual-Frequency Radar Wave-Inversion for Sub-Surface Material Characterization | Ishfaq Aziz et.al. | 2403.19853 | null |
2024-03-28 | RSMamba: Remote Sensing Image Classification with State Space Model | Keyan Chen et.al. | 2403.19654 | link |
2024-04-01 | Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis | Chenyang Liu et.al. | 2403.19646 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Energy-ordered resource stratification as an agnostic signature of life | Akshit Goyal et.al. | 2403.18614 | null |
2024-03-27 | Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding | Run Shao et.al. | 2403.18593 | link |
2024-03-27 | TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes | Liangyu Xu et.al. | 2403.18238 | null |
2024-03-26 | ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection | Mubashir Noman et.al. | 2403.17909 | link |
2024-03-26 | Sen2Fire: A Challenging Benchmark Dataset for Wildfire Detection using Sentinel Data | Yonghao Xu et.al. | 2403.17884 | null |
2024-03-26 | Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model | Runmin Dong et.al. | 2403.17460 | link |
2024-03-25 | In the Search for Optimal Multi-view Learning Models for Crop Classification with Global Remote Sensing Data | Francisco Mena et.al. | 2403.16582 | link |
2024-03-22 | Federated Bayesian Deep Learning: The Application of Statistical Aggregation Methods to Bayesian Models | John Fischer et.al. | 2403.15263 | null |
2024-03-22 | An Integrated Neighborhood and Scale Information Network for Open-Pit Mine Change Detection in High-Resolution Remote Sensing Images | Zilin Xie et.al. | 2403.15032 | null |
2024-03-15 | A2CI: A Cloud-based, Service-oriented Geospatial Cyberinfrastructure to Support Atmospheric Research | Wenwen Li et.al. | 2403.14693 | null |
2024-03-21 | Global, robust and comparable digital carbon assets | Sadiq Jaffer et.al. | 2403.14581 | null |
2024-03-21 | Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images | Tom Burgert et.al. | 2403.14547 | null |
2024-03-21 | Early Flood Warning Using Satellite-Derived Convective System and Precipitation Data – A Retrospective Case Study of Central Vietnam | Tran-Vu La et.al. | 2403.14395 | null |
2024-03-21 | Assimilation of SWOT Altimetry and Sentinel-1 Flood Extent Observations for Flood Reanalysis – A Proof-of-Concept | Thanh Huy Nguyen et.al. | 2403.14394 | null |
2024-03-21 | Impact Assessment of Missing Data in Model Predictions for Earth Observation Applications | Francisco Mena et.al. | 2403.14297 | link |
2024-03-21 | HySim: An Efficient Hybrid Similarity Measure for Patch Matching in Image Inpainting | Saad Noufel et.al. | 2403.14292 | null |
2024-03-21 | Training point-based deep learning networks for forest segmentation with synthetic data | Francisco Raverta Capua et.al. | 2403.14115 | link |
2024-03-20 | Leveraging feature communication in federated learning for remote sensing image classification | Anh-Kiet Duong et.al. | 2403.13575 | null |
2024-03-20 | MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Di Wang et.al. | 2403.13430 | link |
2024-03-20 | Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images | Jiawei Zhou et.al. | 2403.13375 | null |
2024-03-18 | IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images | Meilin Wang et.al. | 2403.11870 | link |
2024-03-22 | LSKNet: A Foundation Lightweight Backbone for Remote Sensing | Yuxuan Li et.al. | 2403.11735 | link |
2024-03-25 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | link |
2024-03-17 | Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution | Jialu Sui et.al. | 2403.11078 | link |
2024-03-16 | LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival | Yuanxin Zhao et.al. | 2403.10887 | null |
2024-03-14 | Uncertainty estimation in spatial interpolation of satellite precipitation with ensemble learning | Georgia Papacharalampous et.al. | 2403.10567 | null |
2024-03-14 | DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification | Qianqian Wu et.al. | 2403.09367 | link |
2024-03-14 | Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening | Andrew Wang et.al. | 2403.09327 | link |
2024-03-14 | Randomized Principal Component Analysis for Hyperspectral Image Classification | Mustafa Ustuner et.al. | 2403.09117 | null |
2024-03-13 | Local Binary and Multiclass SVMs Trained on a Quantum Annealer | Enrico Zardini et.al. | 2403.08584 | link |
2024-03-13 | Causal Graph Neural Networks for Wildfire Danger Prediction | Shan Zhao et.al. | 2403.08414 | null |
2024-03-13 | Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained Ship Classification | Long Lan et.al. | 2403.08271 | null |
2024-03-14 | Red Teaming Models for Hyperspectral Image Analysis Using Explainable AI | Vladimir Zaigrajew et.al. | 2403.08017 | null |
2024-03-12 | Feasibility of machine learning-based rice yield prediction in India at the district level using climate reanalysis data | Djavan De Clercq et.al. | 2403.07967 | null |
2024-03-12 | RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model | Mingze Wang et.al. | 2403.07564 | link |
2024-03-12 | Automated Discovery of Anomalous Features in Ultra-Large Planetary Remote Sensing Datasets using Variational Autoencoders | Adam Lesnikowski et.al. | 2403.07424 | link |
2024-03-12 | ACMI: An index for exposed coal mapping using Landsat imagery | Zhen Yang et.al. | 2403.07220 | null |
2024-03-21 | A Geospatial Approach to Predicting Desert Locust Breeding Grounds in Africa | Ibrahim Salihu Yusuf et.al. | 2403.06860 | null |
2024-03-13 | Koopman Ensembles for Probabilistic Time Series Forecasting | Anthony Frion et.al. | 2403.06757 | link |
2024-03-20 | Poly Kernel Inception Network for Remote Sensing Detection | Xinhao Cai et.al. | 2403.06258 | link |
2024-03-09 | Learned 3D volumetric recovery of clouds and its uncertainty for climate analysis | Roi Ronen et.al. | 2403.05932 | null |
2024-03-09 | Room temperature single-photon terahertz detection with thermal Rydberg atoms | Danyang Li et.al. | 2403.05833 | null |
2024-03-09 | Weakly Supervised Change Detection via Knowledge Distillation and Multiscale Sigmoid Inference | Binghao Lu et.al. | 2403.05796 | link |
2024-03-08 | Probabilistic Image-Driven Traffic Modeling via Remote Sensing | Scott Workman et.al. | 2403.05521 | null |
2024-03-08 | EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAV | Huiming Sun et.al. | 2403.05422 | link |
2024-03-08 | Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery | Mubashir Noman et.al. | 2403.05419 | link |
2024-03-08 | Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery | Xavier Bou et.al. | 2403.05381 | link |
2024-03-11 | Self-Supervision in Time for Satellite Images(S3-TSS): A novel method of SSL technique in Satellite images | Akansh Maurya et.al. | 2403.04859 | link |
2024-03-07 | Impacts of Color and Texture Distortions on Earth Observation Data in Deep Learning | Martin Willbo et.al. | 2403.04385 | null |
2024-03-10 | Photon Absorption Remote Sensing (PARS): A Comprehensive Approach to Label-free Absorption Microscopy Across Biological Scales | Ben Ecclestone et.al. | 2403.04229 | null |
2024-03-06 | Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery | Wei Zhang et.al. | 2403.03790 | null |
2024-03-06 | Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery | Jingru Zhu et.al. | 2403.03704 | null |
2024-03-06 | Portraying the Need for Temporal Data in Flood Detection via Sentinel-1 | Xavier Bou et.al. | 2403.03671 | null |
2024-03-05 | Remote sensing of soil moisture using Rydberg atoms and satellite signals of opportunity | Darmindra Arumugam et.al. | 2403.03175 | null |
2024-03-05 | From Spectra to Biophysical Insights: End-to-End Learning with a Biased Radiative Transfer Model | Yihang She et.al. | 2403.02922 | link |
2024-03-05 | DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation | Lingyan Ran et.al. | 2403.02784 | null |
2024-03-04 | UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images | Zhiyi He et.al. | 2403.02132 | null |
2024-03-04 | Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models | Benedikt Blumenstiel et.al. | 2403.02059 | link |
2024-03-12 | Tree Counting by Bridging 3D Point Clouds with Imagery | Lei Li et.al. | 2403.01932 | null |
2024-03-04 | Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey | Lingyan Ran et.al. | 2403.01909 | null |
2024-03-03 | AIO2: Online Correction of Object Labels for Deep Learning with Incomplete Annotation in Remote Sensing Image Segmentation | Chenying Liu et.al. | 2403.01641 | link |
2024-03-03 | SA-MixNet: Structure-aware Mixup and Invariance Learning for Scribble-supervised Road Extraction in Remote Sensing Images | Jie Feng et.al. | 2403.01381 | null |
2024-03-01 | Fractal interpolation in the context of prediction accuracy optimization | Alexandra Baicoianu et.al. | 2403.00403 | null |
2024-02-29 | Towards localized accuracy assessment of remote-sensing derived built-up land layers across the rural-urban continuum | Johannes H. Uhl et.al. | 2403.00166 | null |
2024-02-29 | Thematic agreement assessment of gridded, multi-modal geospatial datasets of different semantics and spatial granularities | Johannes H. Uhl et.al. | 2403.00161 | null |
2024-02-29 | RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation | Jie Zhang et.al. | 2402.19004 | null |
2024-02-29 | Boosting Semi-Supervised Object Detection in Remote Sensing Images With Active Teaching | Boxuan Zhang et.al. | 2402.18958 | null |
2024-02-28 | Urban Green Index estimation based on data collected by remote sensing for Romanian cities | Marian Necula et.al. | 2402.18618 | null |
2024-02-27 | Cartographie de l’habitat de reproduction du tétras-lyre (Lyrurus tetrix) dans les Alpes françaises | Alexandre Defossez et.al. | 2402.18597 | null |
2024-02-28 | Time-efficient filtering of polarimetric data by checking physical realizability of experimental Mueller matrices | Tatiana Novikova et.al. | 2402.18555 | link |
2024-02-28 | SD-SLAM: A Semantic SLAM Approach for Dynamic Scenes Based on LiDAR Point Clouds | Feiya Li et.al. | 2402.18318 | null |
2024-02-28 | Infrared Small Target Detection via tensor $L_{2,1}$ norm minimization and ASSTV regularization: A Novel Tensor Recovery Approach | Jiqian Zhao et.al. | 2402.18003 | null |
2024-02-27 | Physics-Informed Machine Learning for the Inverse Design of Wave Scattering Clusters | Joshua R. Tempelman et.al. | 2402.17816 | null |
2024-02-27 | Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network | Zhaoyang Wang et.al. | 2402.17285 | link |
2024-02-27 | Fibre-integrated van der Waals quantum sensor with an optimal cavity interface | Jong Sung Moon et.al. | 2402.17095 | null |
2024-02-26 | Automated Floodwater Depth Estimation Using Large Multimodal Model for Rapid Flood Mapping | Temitope Akinboyewa et.al. | 2402.16684 | null |
2024-02-26 | Quick unsupervised hyperspectral dimensionality reduction for earth observation: a comparison | Daniela Lupu et.al. | 2402.16566 | null |
2024-02-26 | Intelligent Known and Novel Aircraft Recognition – A Shift from Classification to Similarity Learning for Combat Identification | Ahmad Saeed et.al. | 2402.16486 | null |
2024-02-26 | HSONet:A Siamese foreground association-driven hard case sample optimization network for high-resolution remote sensing image change detection | Chao Tao et.al. | 2402.16242 | null |
2024-02-25 | Task Specific Pretraining with Noisy Labels for Remote sensing Image Segmentation | Chenying Liu et.al. | 2402.16164 | null |
2024-02-25 | Semi-supervised Open-World Object Detection | Sahal Shaji Mullappilly et.al. | 2402.16013 | link |
2024-03-06 | Cross-Resolution Land Cover Classification Using Outdated Products and Transformers | Huan Ni et.al. | 2402.16001 | link |
2024-02-24 | DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data | Lixian Zhang et.al. | 2402.15659 | link |
2024-02-22 | AuroraMag: Twin Explorer of Asymmetry in Aurora and Solar Wind-Magnetosphere Coupling | Ankush Bhaskar et.al. | 2402.14325 | null |
2024-03-01 | BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery | Loddo Fabio et.al. | 2402.13918 | link |
2024-02-21 | Opening the Black-Box: A Systematic Review on Explainable AI in Remote Sensing | Adrian Höhl et.al. | 2402.13791 | null |
2024-02-21 | Robustness analysis and station-keeping control of an interferometer formation flying mission in low Earth orbit | Cristina Erbeia et.al. | 2402.13702 | null |
2024-02-20 | On the Origin of the sudden Heliospheric Open Magnetic Flux Enhancement during the 2014 Pole Reversal | Stephan G. Heinemann et.al. | 2402.12805 | null |
2024-02-19 | Nonlocality enhanced precision in quantum polarimetry via entangled photons | Ali Pedram et.al. | 2402.11932 | null |
2024-02-26 | ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models | Zhenghang Yuan et.al. | 2402.11325 | link |
2024-02-16 | Model-assisted estimation of domain totals, areas, and densities in two-stage sample survey designs | Hans-Erik Andersen et.al. | 2402.11029 | null |
2024-02-15 | The affect of Some Meteorological Parameters on Particulate Matters Concentration Over Iraq using Remote Sensing dataset | Sabah Hussein Ali et.al. | 2402.10285 | null |
2024-02-15 | ViGEO: an Assessment of Vision GNNs in Earth Observation | Luca Colomba et.al. | 2402.09962 | link |
2024-02-15 | Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment | Angelos Zavras et.al. | 2402.09816 | null |
2024-02-14 | Solid Waste Detection in Remote Sensing Images: A Survey | Piero Fraternali et.al. | 2402.09066 | null |
2024-02-13 | Direct numerical simulation of a thermal turbulent boundary layer: an analogy to simulate bushfires and a testbed for artificial intelligence remote sensing of bushfire propagation | Julio Soria et.al. | 2402.08157 | null |
2024-02-12 | Spectrum Coexistence of Satellite-borne Passive Radiometry and Terrestrial Next-G Networks | Mohammad Koosha et.al. | 2402.08002 | null |
2024-02-10 | A Change Detection Reality Check | Isaac Corley et.al. | 2402.06994 | link |
2024-02-08 | Ai4Fapar: How artificial intelligence can help to forecast the seasonal earth observation signal | Filip Sabo et.al. | 2402.06684 | null |
2024-02-04 | Using remotely sensed data for air pollution assessment | Teresa Bernardino et.al. | 2402.06653 | null |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-08 | 3D-2D Neural Nets for Phase Retrieval in Noisy Interferometric Imaging | Andrew H. Proppe et.al. | 2402.06063 | null |
2024-02-08 | On Convolutional Vision Transformers for Yield Prediction | Alvin Inderka et.al. | 2402.05557 | null |
2024-02-07 | Knowledge Distillation for Road Detection based on cross-model Semi-Supervised Learning | Wanli Ma et.al. | 2402.05305 | null |
2024-02-07 | Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty | Hersh Vakharia et.al. | 2402.05045 | link |
2024-02-07 | Accurate Zernike-Corrected Phase Screens for Arbitrary Power Spectra | David Bachmann et.al. | 2402.04826 | null |
2024-02-05 | Small area estimation of forest biomass via a two-stage model for continuous zero-inflated data | Grayson W. White et.al. | 2402.03263 | null |
2024-02-05 | AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image | Hamed Amini Amirkolaee et.al. | 2402.02956 | link |
2024-03-18 | LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Dilxat Muhtar et.al. | 2402.02544 | link |
2024-02-03 | A 3-D Full-Wave Model to Study the Impact of Soybean Components and Structure on L-Band Backscatter | Kaiser Niknam et.al. | 2402.02292 | null |
2024-03-05 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | Bo Yang et.al. | 2402.02141 | link |
2024-02-03 | Enhancing crop classification accuracy by synthetic SAR-Optical data generation using deep learning | Ali Mirzaei et.al. | 2402.02121 | null |
2024-02-03 | Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification | Wenjia Xu et.al. | 2402.02094 | link |
2024-02-01 | Seismic Traveltime Tomography with Label-free Learning | Feng Wang et.al. | 2402.00310 | link |
2024-01-08 | Kronecker Product Feature Fusion for Convolutional Neural Network in Remote Sensing Scene Classification | Yinzhu Cheng et.al. | 2402.00036 | null |
2024-01-31 | Shrub of a thousand faces: an individual segmentation from satellite images using deep learning | Rohaifa Khaldi et.al. | 2401.17985 | null |
2024-01-31 | Source-free Domain Adaptive Object Detection in Remote Sensing Images | Weixing Liu et.al. | 2401.17916 | null |
2024-01-31 | Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model | Zihan Zhong et.al. | 2401.17868 | link |
2024-02-01 | Tiered approach for rapid damage characterisation of infrastructure enabled by remote sensing and deep learning technologies | Nadiia Kopiika et.al. | 2401.17759 | null |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-30 | Frequency-domain multiplexing of SNSPDs with tunable superconducting resonators | Sasha Sypkens et.al. | 2401.17454 | null |
2024-01-30 | Towards Assessing the Synthetic-to-Measured Adversarial Vulnerability of SAR ATR | Bowen Peng et.al. | 2401.17038 | link |
2024-03-08 | EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain | Wei Zhang et.al. | 2401.16822 | link |
2024-01-29 | SAT-CEP-monitor: An air quality monitoring software architecture combining complex event processing with satellite remote sensing | Badr-Eddine Boudriki Semlali et.al. | 2401.16339 | null |
2024-01-29 | Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing | Jeongho Min et.al. | 2401.15944 | null |
2024-01-29 | Assessment of the area measurement on Cartosat-1 image | Joanna Pluto-Kossakowska et.al. | 2401.15932 | null |
2024-01-29 | Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing | Maofeng Tang et.al. | 2401.15855 | link |
2024-01-28 | Improvements of readout signal integrity in mid-infrared superconducting nanowire single photon detectors | Sahil R. Patel et.al. | 2401.15764 | null |
2024-01-26 | Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis | Mingshi Li et.al. | 2401.15223 | null |
2024-01-26 | Metabolic light absorption, scattering and emission (MetaLASE) microscopy | Brendon S. Restall et.al. | 2401.15135 | null |
2024-01-25 | Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery | Jialu Sui et.al. | 2401.15105 | link |
2024-01-26 | Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring | Juan Castorena et.al. | 2401.15029 | null |
2024-01-26 | Towards Robust Hyperspectral Anomaly Detection: Decomposing Background, Anomaly, and Mixed Noise via Convex Optimization | Koyo Sato et.al. | 2401.14814 | null |
2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | Jon Alvarez Justo et.al. | 2401.14786 | null |
2024-01-25 | Efficient stripe artefact removal by a variational method: application to light-sheet microscopy, FIB-SEM and remote sensing images | Niklas Rottmayer et.al. | 2401.14220 | null |
2024-01-23 | Local Background Estimation for Improved Gas Plume Identification in Hyperspectral Images | Scout Jarman et.al. | 2401.13068 | null |
2024-01-23 | Unlocking the Potential: Multi-task Deep Learning for Spaceborne Quantitative Monitoring of Fugitive Methane Plumes | Guoxin Si et.al. | 2401.12870 | null |
2024-01-22 | Semi-supervised segmentation of land cover images using nonlinear canonical correlation analysis with multiple features and t-SNE | Hong Wei et.al. | 2401.12164 | null |
2024-01-22 | Secure Multi-hop Telemetry Broadcasts for UAV Swarm Communication | Randolf Rotta et.al. | 2401.11915 | null |
2024-01-22 | Adaptive Fusion of Multi-view Remote Sensing data for Optimal Sub-field Crop Yield Prediction | Francisco Mena et.al. | 2401.11844 | link |
2024-01-22 | MetaSeg: Content-Aware Meta-Net for Omni-Supervised Semantic Segmentation | Shenwang Jiang et.al. | 2401.11738 | null |
2024-01-21 | CaBuAr: California Burned Areas dataset for delineation | Daniele Rege Cambrin et.al. | 2401.11519 | link |
2024-01-21 | Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation | Yaniv Zimmer et.al. | 2401.11420 | null |
2024-01-18 | Enhanced Automated Quality Assessment Network for Interactive Building Segmentation in High-Resolution Remote Sensing Imagery | Zhili Zhang et.al. | 2401.09828 | link |
2024-01-18 | SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model | Yang Zhan et.al. | 2401.09712 | link |
2024-01-17 | Exact Analytical Solution of the One-Dimensional Time-Dependent Radiative Transfer Equation with Linear Scattering | Vladimir Allaxwerdian et.al. | 2401.09511 | null |
2024-01-15 | 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data | Mathilde Letard et.al. | 2401.09481 | link |
2024-01-17 | Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery | Jia Jia et.al. | 2401.09325 | null |
2024-01-17 | PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost Disturbances | Konrad Heidler et.al. | 2401.09271 | link |
2024-01-17 | Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models | Haonan Guo et.al. | 2401.09083 | link |
2024-01-17 | Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM) | Hongruixuan Chen et.al. | 2401.09019 | null |
2024-01-17 | Learning to detect cloud and snow in remote sensing images from noisy labels | Zili Liu et.al. | 2401.08932 | null |
2024-02-12 | Remote sensing of a levitated superconductor with a flux-tunable microwave cavity | Philip Schmidt et.al. | 2401.08854 | null |
2024-01-16 | Image Fusion in Remote Sensing: An Overview and Meta Analysis | Hessah Albanwan et.al. | 2401.08837 | null |
2024-01-16 | Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network | Zida Chen et.al. | 2401.08171 | link |
2024-01-16 | Robust Tiny Object Detection in Aerial Images amidst Label Noise | Haoran Zhu et.al. | 2401.08056 | link |
2024-01-15 | Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing | Jakob Hackstein et.al. | 2401.07782 | link |
2024-01-15 | One for All: Toward Unified Foundation Models for Earth Vision | Zhitong Xiong et.al. | 2401.07527 | null |
2024-01-15 | PolMERLIN: Self-Supervised Polarimetric Complex SAR Image Despeckling with Masked Networks | Shunya Kato et.al. | 2401.07503 | null |
2024-01-13 | Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization | Somya Sharma et.al. | 2401.07175 | null |
2024-01-13 | Deep Blind Super-Resolution for Satellite Video | Yi Xiao et.al. | 2401.07139 | link |
2024-02-08 | Multimodal Urban Areas of Interest Generation via Remote Sensing Imagery and Geographical Prior | Chuanji Shi et.al. | 2401.06550 | null |
2024-01-12 | PCB-Vision: A Multiscene RGB-Hyperspectral Benchmark Dataset of Printed Circuit Boards | Elias Arbash et.al. | 2401.06528 | link |
2024-01-10 | Toward distortion-aware change detection in realistic scenarios | Yitao Zhao et.al. | 2401.05157 | null |
2024-01-10 | SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image | Jiayuan Tian et.al. | 2401.05093 | null |
2024-01-10 | Mapping Information in Feature Extraction Transformation for Chirp Signal | Shuyi Gu et.al. | 2401.05000 | null |
2024-01-09 | Spatio-temporal data fusion for the analysis of in situ and remote sensing data using the INLA-SPDE approach | Shiyu He et.al. | 2401.04723 | null |
2024-01-21 | Generic Knowledge Boosted Pre-training For Remote Sensing Images | Ziyue Huang et.al. | 2401.04614 | link |
2024-01-15 | PhilEO Bench: Evaluating Geo-Spatial Foundation Models | Casper Fibaek et.al. | 2401.04464 | link |
2024-01-09 | Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation | Linshan Wu et.al. | 2401.04412 | link |
2024-03-03 | BD-MSA: Body decouple VHR Remote Sensing Image Change Detection method guided by multi-scale feature information aggregation | Yonghui Tan et.al. | 2401.04330 | null |
2024-01-06 | Real Time Human Detection by Unmanned Aerial Vehicles | Walid Guettala et.al. | 2401.03275 | null |
2024-01-05 | Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing | Hugo Chan-To-Hing et.al. | 2401.02764 | null |
2024-01-05 | VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection | Ziying Song et.al. | 2401.02702 | null |
2024-01-04 | A comprehensive survey of research towards AI-enabled unmanned aerial systems in pre-, active-, and post-wildfire management | Sayed Pedram Haeri Boroujeni et.al. | 2401.02456 | null |
2024-01-04 | A Generalized Variable Projection Algorithm for Least Squares Problems in Atmospheric Remote Sensing | Adelina Bärligea et.al. | 2401.02301 | null |
2024-01-04 | Frequency-Adaptive Pan-Sharpening with Mixture of Experts | Xuanhua He et.al. | 2401.02151 | link |
2024-01-03 | On the Mesoscale Structure of CMEs at Mercury’s Orbit: BepiColombo and Parker Solar Probe Observations | Erika Palmerio et.al. | 2401.01875 | null |
2024-01-03 | A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types | Paul B. May et.al. | 2401.01848 | null |
2024-01-04 | Few-shot Adaptation of Multi-modal Foundation Models: A Survey | Fan Liu et.al. | 2401.01736 | null |
2024-01-03 | Quantum sensing of microwave electric fields based on Rydberg atoms | Jinpeng Yuan et.al. | 2401.01655 | null |
2024-01-03 | Free Lunch for Federated Remote Sensing Target Fine-Grained Classification: A Parameter-Efficient Framework | Shengchao Chen et.al. | 2401.01493 | null |
2024-01-10 | Mapping Walnut Water Stress with High Resolution Multispectral UAV Imagery and Machine Learning | Kaitlyn Wang et.al. | 2401.01375 | null |
2024-01-02 | GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction | Yuping Hu et.al. | 2401.01178 | null |
2024-01-02 | AI-FLARES: Artificial Intelligence for the Analysis of Solar Flares Data | Michele Piana et.al. | 2401.01104 | null |
2023-12-29 | RS-DGC: Exploring Neighborhood Statistics for Dynamic Gradient Compression on Remote Sensing Image Interpretation | Weiying Xie et.al. | 2312.17530 | null |
2023-12-28 | Extended Aerosol Optical Depth (AOD) time series analysis in an Alpine Valley: A Comparative Study from 2007 to 2023 | Jochen Wagner et.al. | 2312.17362 | null |
2023-12-23 | On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications | Chenjiao Tan et.al. | 2312.17016 | null |
2023-12-27 | Landslide Detection and Segmentation Using Remote Sensing Images and Deep Neural Network | Cam Le et.al. | 2312.16717 | null |
2023-12-27 | Segment Change Model (SCM) for Unsupervised Change detection in VHR Remote Sensing Images: a Case Study of Buildings | Xiaoliang Tan et.al. | 2312.16410 | link |
2023-12-27 | Analytical Insight of Earth: A Cloud-Platform of Intelligent Computing for Geospatial Big Data | Hao Xu et.al. | 2312.16385 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-23 | Time Travelling Pixels: Bitemporal Features Integration with Foundation Model for Remote Sensing Image Change Detection | Keyan Chen et.al. | 2312.16202 | link |
2023-12-24 | Superpixel-based and Spatially-regularized Diffusion Learning for Unsupervised Hyperspectral Image Clustering | Kangning Cui et.al. | 2312.15447 | link |
2023-12-24 | Debiased Learning for Remote Sensing Data | Chun-Hsiao Yeh et.al. | 2312.15393 | null |
2023-12-24 | Hyperspectral shadow removal with Iterative Logistic Regression and latent Parametric Linear Combination of Gaussians | Core Francisco Park et.al. | 2312.15386 | null |
2023-12-23 | Pixel-Level Change Detection Pseudo-Label Learning for Remote Sensing Change Captioning | Chenyang Liu et.al. | 2312.15311 | null |
2023-12-20 | SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing | Zhecheng Wang et.al. | 2312.12856 | link |
2023-12-20 | MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images | Libo Wang et.al. | 2312.12735 | null |
2024-02-27 | Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation | Sihan Liu et.al. | 2312.12470 | link |
2023-12-19 | EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering | Junjue Wang et.al. | 2312.12222 | link |
2023-12-18 | Satellite Captioning: Large Language Models to Augment Labeling | Grant Rosario et.al. | 2312.10905 | null |
2023-12-18 | Country-Scale Cropland Mapping in Data-Scarce Settings Using Deep Learning: A Case Study of Nigeria | Joaquin Gajardo et.al. | 2312.10872 | link |
2023-12-17 | Satellite Data Shows Resilience of Tigrayan Farmers in Crop Cultivation During Civil War | Hannah Kerner et.al. | 2312.10819 | link |
2023-12-17 | A Framework of Full-Process Generation Design for Park Green Spaces Based on Remote Sensing Segmentation-GAN-Diffusion | Ran Chen et.al. | 2312.10674 | null |
2023-12-15 | SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery | Xin Guo et.al. | 2312.10115 | null |
2023-12-15 | FoMo-Bench: a multi-modal, multi-scale and multi-task Forest Monitoring Benchmark for remote sensing foundation models | Nikolaos Ioannis Bountos et.al. | 2312.10114 | link |
2023-12-13 | Coexistence of Satellite-borne Passive Radiometry and Terrestrial NextG Wireless Networks in the 1400-1427 MHz Restricted L-Band | Mohammad Koosha et.al. | 2312.08551 | null |
2023-12-13 | SVInvNet: A Densely Connected Encoder-Decoder Architecture for Seismic Velocity Inversion | Mojtaba Najafi Khatounabad et.al. | 2312.08194 | null |
2023-12-13 | Encoder-minimal and Decoder-minimal Framework for Remote Sensing Image Dehazing | Yuanbo Wen et.al. | 2312.07849 | link |