• lakemalcom10@lemm.ee
    link
    fedilink
    English
    arrow-up
    11
    ·
    15 hours ago

    For 1 they actually addressed that: The system then translates the speech and maintains the expressive qualities and volume of each speaker’s voice while running on a device, such mobile devices with an Apple M2 chip like laptops and Apple Vision Pro. (The team avoided using cloud computing because of the privacy concerns with voice cloning.) Finally, when speakers move their heads, the system continues to track the direction and qualities of their voices as they change.

    • Ilovethebomb@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      ·
      14 hours ago

      The fact that all this can run on a phone is incredible, this sounds very processor intensive.

      I wonder what it would do to your battery life?

    • stoy@lemmy.zip
      link
      fedilink
      English
      arrow-up
      2
      ·
      15 hours ago

      If that is enough power, and you can run it without any internet access, then yes, it would probably adress point 1.