Foreground-aware Dense Depth Estimation for 360 Images

Feng, Qi; Shum, Hubert P. H.; Shimamura, Ryo; Morishima, Shigeo

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Feng, Qi
dc.contributor.author	Shum, Hubert P. H.
dc.contributor.author	Shimamura, Ryo
dc.contributor.author	Morishima, Shigeo
dc.contributor.editor	Skala, Václav
dc.date.accessioned	2020-07-24T07:00:36Z	-
dc.date.available	2020-07-24T07:00:36Z	-
dc.date.issued	2020
dc.identifier.citation	Journal of WSCG. 2020, vol. 28, no. 1-2, p. 79-88.	en
dc.identifier.issn	1213-6972 (print)
dc.identifier.issn	1213-6980 (CD-ROM)
dc.identifier.issn	1213-6964 (on-line)
dc.identifier.uri	http://wscg.zcu.cz/WSCG2020/2020-J_WSCG-1-2.pdf
dc.identifier.uri	http://hdl.handle.net/11025/38428
dc.format	10 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Václav Skala - UNION Agency	cs
dc.relation.ispartofseries	Journal of WSCG	en
dc.rights	© Václav Skala - UNION Agency	cs
dc.subject	odhad hloubky	cs
dc.subject	porozumění scéně	cs
dc.subject	rozšiřování dat	cs
dc.subject	360 obrázků	cs
dc.title	Foreground-aware Dense Depth Estimation for 360 Images	en
dc.type	článek	cs
dc.type	article	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	With 360 imaging devices becoming widely accessible, omnidirectional content has gained popularity in multiple fields. The ability to estimate depth from a single omnidirectional image can benefit applications such as robotics navigation and virtual reality. However, existing depth estimation approaches produce sub-optimal results on real-world omnidirectional images with dynamic foreground objects. On the one hand, capture-based methods cannot obtain the foreground due to the limitations of the scanning and stitching schemes. On the other hand, it is challenging for synthesis-based methods to generate highly-realistic virtual foreground objects that are comparable to the real-world ones. In this paper, we propose to augment datasets with realistic foreground objects using an image-based approach, which produces a foreground-aware photorealistic dataset for machine learning algorithms. By exploiting a novel scale-invariant RGB-D orrespondence in the spherical domain, we repurpose abundant non-omnidirectional datasets to include realistic foreground objects with correct distortions. We further propose a novel auxiliary deep neural network to estimate both the depth of the omnidirectional images and the mask of the foreground objects, where the two tasks facilitate each other. A new local depth loss considers small regions of interests and ensures that their depth estimations are not smoothed out during the global gradient’s optimization. We demonstrate the system using human as the foreground due to its complexity and contextual importance, while the framework can be generalized to any other foreground objects. Experimental results demonstrate more consistent global estimations and more accurate local estimations compared with state-of-the-arts.	en
dc.subject.translated	depth estimation	en
dc.subject.translated	scene understanding	en
dc.subject.translated	data augmentation	en
dc.subject.translated	360 images	en
dc.identifier.doi	https://doi.org/10.24132/JWSCG.2020.28.10
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	Volume 28, Number 1-2 (2020)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
Feng.pdf	Plný text	10,45 MB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/38428

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace