How can the driven-audio feature a and the landmark representation l be used for cross-attention module? #21

Haoqing-Wang · 2023-08-21T09:57:18Z

As we all know, the driven-audio feature a and the landmark representation l are just a vector, not a batch of vectors, so how can they be used in cross-attention module as Key and Value?

WoofGH · 2024-04-16T11:06:56Z

Did you understand how this works? I'm totally confused right now😭.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can the driven-audio feature a and the landmark representation l be used for cross-attention module? #21

How can the driven-audio feature a and the landmark representation l be used for cross-attention module? #21

Haoqing-Wang commented Aug 21, 2023

WoofGH commented Apr 16, 2024

How can the driven-audio feature a and the landmark representation l be used for cross-attention module? #21

How can the driven-audio feature a and the landmark representation l be used for cross-attention module? #21

Comments

Haoqing-Wang commented Aug 21, 2023

WoofGH commented Apr 16, 2024