This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
— We present a self-calibrating photogeometric method using only off-the-shelf hardware that enables quickly and robustly obtaining multi-million point-sampled and colored models...