Our novel H2 augmentation and FPAT with chroma keying successfully achieves high-fidelity head swapping performance. Point and drag your mouse to see the results!
In the film industry, scenarios arise where footage of stunt performers necessitates subsequent replacement with the original actor's head. Deep learning-based head swapping may be a viable solution in such a scenario. However, we point out that existing head swapping frameworks still show artifacts, such as blurry results and imperfect foreground and background distinction. To mitigate this problem, we propose Chroma-HS, a new pipeline that generates high-fidelity results via splitting the head swapping task into the background and the foreground generation. Chroma-HS introduces chroma keying to the head swapping for the first time, which enables a flawless and diverse background generation. To this end, we introduce two novel methods to generate high-fidelity foregrounds. We propose Head shape and long Hair augmentation (H2 augmentation), which mimics diverse head attributes. Finally, Chroma-HS incorporates Foreground Predictive Attention Transformer (FPAT) which generates the foreground region by restricting the attention region with the predicted body mask. Experimental results show that our Chroma-HS significantly outperforms the state-of-the-art head swapping model on benchmark datasets both qualitatively and quantitatively.
We utilized the pre-trained face reenactment network to generate our head swapping results.
[1] Latent Image Animator: Learning to Animate Images via Latent Space Navigation.