2f Applied Artificial intelligence for Virtual Worlds

AI is a cornerstone technology for the development of Virtual Worlds (VW), in the stage of content creation and in VW operation. Research gaps in this area include the creation and management of digital content, automating processes, improving the quality of virtual experiences, enabling interoperability. Machine learning techniques must be refined to create more realistic and dynamic virtual entities, from non-player characters to virtual assistants. AI can enhance the development of advanced algorithms that can understand and respond to user behaviour in real-time. Additionally, by addressing these research challenges, we can harness the power of AI to create more immersive, interactive, and intelligent VW.

Research topics for AI for Virtual Worlds

AI for perception and sensing

Understanding the environment is crucial for enabling immersive and interactive VR/AR/MR applications. AI plays a significant role in interpreting sensory data, but several challenges remain to achieve truly effective perception and sensing in VW applications. The main envisioned research topics include those discussed in the following sections.

2f.1 Artificial intelligence -based computer vision methods

AI in computer vision empowers virtual environments to understand the physical world. This enables capabilities like object recognition, pose estimation, and spatial mapping, allowing virtual elements to inherit semantics and properties from their physical counterparts. This fosters a more realistic blending of physical and digital worlds, enhancing user interaction and context awareness.

Problem Definition and Research Gap: Current digital-physical interactions are largely limited to graphical overlays. A truly realistic blending demands AI to capture diverse aspects beyond appearance, including object semantics and contextualisation. These properties must be dynamically captured and updated, with changes in one world reciprocally impacting the other, allowing users to perceive and modify virtual elements based on real-world properties.

Research and Innovation Objectives:

Develop and exploit foundational multimodal models (e.g., vision-language models) for comprehensive scene analysis and information transfer to 3D models
Advance vision-based AI for accurate semantic and property association with virtual elements from real-time or acquired 3D data
Ensure dynamic property updates and perceptible real-world reflections in the virtual environment
Enable egocentric, energy efficient and mobile perception of the scene and user interactions

2f.2 AI for Audio analysis

AI-based audio analysis enhances real-time speech recognition, which, integrated with Natural Language Processing (NLP) and Large Language Models (LLMs), facilitates more natural interaction with virtual environments (e.g., smart digital assistants). Voice analysis also detects user emotions. Furthermore, audio analysis improves situation awareness by identifying environmental features crucial for seamlessly blending physical and digital worlds.

Problem Definition and Research Gap: Human hearing is paramount for communication and environmental awareness. To enable natural user interaction in VW, this fundamental capability must be effectively transferred and replicated digitally. This requires sophisticated AI models to accurately interpret diverse real-world audio cues and translate them into actionable information within the virtual environment, bridging a key sensory immersion gap.

Research and Innovation Objectives:

Develop advanced speech recognition and synthesis technologies, combined with Generative AI and LLMs, to create highly interactive virtual smart digital assistants
Enable digital assistants to understand user requests and respond verbally, personalising answers based on language, context, and culture
Utilise audio analysis to enhance situation awareness and identify environmental features for improved physical-digital blending

Haptic devices and wearable sensors provide users with multi-sensory perceptions in VW, including touch, temperature, smell, and force feedback. These technologies capture detailed user data, allowing machines to interpret human intention and emotional state. Simultaneously, AI-based sensor fusion of diverse inputs, such as cameras, eye trackers, and bio signals, enhances the accuracy, robustness, and resilience of virtual applications. By leveraging AI to orchestrate and combine sensor data, systems can maintain functionality even if certain inputs fail, making virtual interactions more immersive, adaptive, and reliable.

Problem Definition and Research Gap: Advancing the state of the art of sensor fusion and training such Ai-models poses several challenges: multimodal datasets must be precisely annotated, data sources must be time-synchronised, and strong coordination is required between individual modality analysis and fusion modules to ensure real-time performance and coherence of the virtual experience.

Research and Innovation Objectives:

Utilise deep machine learning to analyse signals from both haptic and sensing devices
Employ AI to modulate actuation signals for realistic, individual- and context-adaptive haptic feedback
Leverage AI to reliably interpret raw sensor signals into higher-level concepts, enhancing user intention and emotion understanding
Develop and implement machine learning methods, such as hybrid LSTM–Random Forest networks, for effective multi-modal sensor fusion
Enhance accuracy and robustness in virtual applications by strategically combining heterogeneous sensor data

Edge AI and distributed intelligence

Edge AI in VR/XR integrates AI into immersive devices while coordinating with edge nodes and the cloud for optimized performance. This enables real-time, context-aware, and privacy-preserving interactions by distributing tasks across a multi-tiered system. It tackles latency, bandwidth, and security challenges, ensuring smooth and adaptive experiences. Applications span training, simulation, design, healthcare, and entertainment, making Edge AI crucial for next-gen immersive systems.

2f.4 On-device AI for inference in headsets, haptic devices and sensors

On-device AI enables real-time, low-latency, and privacy-preserving interactions in VW without constant cloud connectivity. This is vital for time-sensitive or privacy-critical applications using immersive AR/VR environments. It ensures seamless UX by mitigating delays that could break immersion or cause discomfort, while also reducing energy consumption and data transmission.

Problem Definition and Research Gap: Current AI systems often demand too many resources for edge devices constrained by power, memory, and compute capacity. Many existing solutions rely on cloud connectivity, limiting their use in remote or privacy-sensitive scenarios. A key research gap lies in developing AI models, hardware architectures, and interaction pipelines that can operate fully or partially on-device, enabling distributed intelligence across the VW.

Research and Innovation Objectives:

Develop and benchmark machine learning models tailored for resource-constrained devices, focusing on lightweight neural network architectures and model optimisation techniques (quantisation, pruning, distillation)
Explore hardware acceleration, including custom ASICs and microcontroller-based AI execution, alongside methodologies like hardware/software co-design
Research edge-based federated learning and continual learning paradigms for local adaptation, ensuring privacy-preserving on-device intelligence

2f.5 Federated learning for collaborative training without compromising privacy

VW systems increasingly collect sensitive user data, such as gaze, motion, and physiological responses. Federated learning (FL) allows valuable insights from this data to improve immersive experiences—like adaptive interfaces and personalised haptics. This method safeguards user privacy while facilitating collaborative model training across distributed edge devices.

Problem Definition and Research Gap: Most current VW systems rely on centralised AI models, which limits privacy, increases latency, and hinders scalability. Edge devices in these environments are resource-constrained and manage complex, heterogeneous, and dynamic data streams. Existing FL approaches aren't fully equipped for the real-time, multimodal nature of XR workloads. This necessitates lightweight, context-aware, and privacy-preserving FL mechanisms specifically designed for distributed, real-time VW systems.

Research and Innovation Objectives:

Develop lightweight FL algorithms tailored for streaming, multimodal VW data
Integrate FL with edge-based privacy-preserving mechanisms, such as secure aggregation and differential privacy
Research cross-device and cross-user learning approaches that maintain temporal continuity and interactivity, including hierarchical or decentralised FL architectures

2f.6 Distributed intelligence across devices, cloud, and edge networks

VR/XR experiences require ultra-low latency, high throughput, and real-time responsiveness. Distributed intelligence offers a transformative approach by enabling adaptive partitioning of AI tasks across XR devices, edge nodes, and cloud backends. This optimises resource use, ensures high quality of experience (QoE), and provides continuous, context-aware AI support.

Problem Definition and Research Gap: Current XR systems lack intelligent orchestration strategies to dynamically allocate tasks across heterogeneous edge environments and offer limited support for collaborative intelligence between co-located or remote devices. Effective edge-centric frameworks must balance latency, energy consumption, privacy, and accuracy, adapting to rapidly changing conditions.

Research and Innovation Objectives:

Design distributed inference pipelines split across headsets, controllers, edge nodes, and the cloud, supported by dynamic orchestration and AI model migration based on context
Implement edge-native caching and prefetching for predictive rendering and interaction, leveraging 5G/6G for low-latency task offloading and synchronisation
Integrate AI accelerators across tiers for seamless collaboration, using methodologies like hierarchical task scheduling and cross-layer optimisation

AI for Content Creation

Generative AI extends beyond prediction to creation, generating novel samples that follow training data distributions while remaining distinct. These technologies also hold great potential for design, creation and editing of 3D VW. This includes the generation of assets such as geometry, textures, animations, and entire scenes—enabling faster iteration cycles and lowering barriers for creative professionals and developers alike.

2f.7 Generative AI for virtual content creation and multi-scale 3D asset generation

AI content creation tools accelerate production workflows and reduce the barrier to entry for non-expert creators. Furthermore, multi-scale spatial and temporal data integration empowers the accurate reconstruction and simulation of real-world phenomena in virtual environments. Simultaneously, advances in 3D object generation and novel view synthesis allow for realistic and interactive 3D content creation from limited input, enhancing immersion and streamlining the design process across industries.

Problem Definition and Research Gap: Despite their promise, current diffusion models lack the ability to generate actionable assets in standardized 3D formats required for integration into modern content creation pipelines, limiting their utility in professional environments. Additionally, existing AI models often struggle with integrating heterogeneous datasets across spatial and temporal dimensions, which hinders accurate prediction and simulation of dynamic real-world scenarios. In 3D generation, the need remains for methods that produce high-fidelity, physically plausible objects and views from sparse data while maintaining compatibility with industry-standard graphics workflows—challenges that must be addressed for scalable, real-world deployment.

Research and Innovation Objectives:

Integrate generative models into existing production pipelines to improve content creation efficiency
Design AI systems for multi-scale data fusion using temporal attention and generative reconstruction of missing data
Create models capable of spatio-temporal prediction from heterogeneous sources, improving realism and coherence in virtual environments
Build 3D object generation frameworks using diffusion and GAN architectures, integrating priors like Neural Radiance Fields and Gaussian splatting
Enable advanced features such as scene relighting, material editing, and novel view synthesis while ensuring compatibility with graphics engines

2f.8 Physics-enabled content generation

Physics-enabled content generation is crucial for creating realistic and interactive virtual environments. For designers, simulation developers, and XR content creators, this ensures physical plausibility and natural behaviour of generated elements, which are essential for achieving deep immersion and high usability within VW.

Problem Definition and Research Gap: A key challenge is that current generative models struggle to produce physically accurate results. There is a notable absence of AI models capable of incorporating fundamental physical laws, such as gravity, fluid dynamics, and object continuity, into the content they generate, limiting the realism and fidelity of virtual environments.

Research and Innovation Objectives:

Develop generative models, including diffusion models and physics-informed neural networks, that embed physical constraints
Implement simulation-guided learning techniques for creating content with inherent physical accuracy

2f.9 Storytelling and application workflows

Integrating AI into storytelling and application workflows enables the creation of dynamic, personalised narratives within VW. This enhances user engagement, streamlines content creation processes, and allows for the scalable production of interactive experiences, benefiting narrative designers, game developers, and educators.

Problem Definition and Research Gap: Traditional storytelling workflows are often linear and resource intensive, limiting adaptability and scalability. There is a need for AI-driven systems capable of generating and managing complex narratives in real-time, adapting to user interactions, and significantly reducing manual content creation efforts.

Research and Innovation Objectives:

Develop AI frameworks combining natural language processing, procedural content generation, and user interaction data for adaptive storytelling
Create tools that facilitate real-time narrative generation and seamless integration with existing content pipelines

2f.10 Multi-modality, sounds, audio, and alignment of modalities

This area enables the generation of synchronised talking avatars or human-like faces directly from audio, drastically reducing the need for video data and manual animation. For VW developers and communication platform engineers, it enhances realism, supports ultra-low-bandwidth applications like synthetic video conferencing, and increases accessibility in storytelling, education, and assistive technologies.

Problem Definition and Research Gap: Current systems struggle to produce realistic and expressive human videos purely from audio, particularly in maintaining consistency in lip sync, facial emotion, and head movement across diverse speakers and acoustic conditions. There is a need for models that not only align modalities accurately but also generate high-fidelity talking avatars or human faces from speech alone with minimal input data.

Research and Innovation Objectives:

Integrate speech-driven generative models, such as audio-conditioned diffusion models and deformable neural avatars, for realistic rendering from audio signals
Focus on real-time performance and deployment on lightweight devices for practical application

2f.11 AAI for inclusion and cultural awareness

Virtual environments must support diverse users across cultures and abilities to ensure immersive, inclusive, and globally relevant experiences. For XR developers and accessibility tool providers, AI enables real-time multilingual communication, cultural sensitivity, and adaptive interfaces, enhancing engagement and broadening participation in VW.

Problem Definition and Research Gap: Current platforms lack for cultural adaptation, and accessibility, leading to exclusion and reduced immersion. There is a need for AI systems that support culturally aware interactions and accommodate diverse user needs through real-time adaptation and barrier-free design principles embedded from the start.

Research and Innovation Objectives:

Develop AI models for real-time language translation, sentiment analysis, and cultural understanding
Enable seamless and respectful communication in virtual spaces through integrated AI solutions
Create adaptive interfaces and assistive tools for users with varying abilities
Build content creation tools that incorporate accessibility by design to ensure inclusive virtual experiences

2f.12 Semantic AI and knowledge graphs and knowledge modelling

World models help AI evolve from perception to deep understanding in virtual environments through structured knowledge representation. They tackle semantic interoperability, real-time reasoning, and accessibility challenges, benefiting developers, researchers, and policymakers. This approach also enhances language interaction, content generation, personalized learning, and digital inclusion, creating a smarter and more accessible virtual world.

Problem Definition and Research Gap: VW pose challenges in handling evolving objects and relationships over time. Semantic interoperability is limited by isolated ecosystems and incompatible data models, making data integration and consistency difficult. A key research gap is developing knowledge models that learn from continuous data streams, capture causal relationships, and enable real-time reasoning in dynamic environments.

Research and Innovation Objectives:

Develop adaptive knowledge graphs that evolve based on observed interactions, enabling robust ontologies and semantic reasoning
Advance data-driven knowledge modelling by combining symbolic and statistical methods for learning from VW data streams
Improve natural language understanding for intuitive interaction through speech and text, integrating hybrid symbolic-neural approaches and federated learning

Foundational models for Virtual Worlds

The use of foundation models can be of paramount importance when dealing with and developing solutions for VW. Here are some research topics that might need to be investigated for this

2f.13 Adaptation and specialisation of foundation models

XR experiences demand tailored interactions matching individual cognitive, physical, and contextual user profiles. Current foundation models, trained on generic data, lack the specificity and adaptability required for specialised immersive scenarios like skill training or rehabilitation. For application designers and UX researchers, this research addresses the need for personalised, context-aware interactions in VW.

Problem Definition and Research Gap: Integrating domain-specific knowledge into foundation models is difficult, as expert data is often locked in proprietary multimodal sources. Key challenges include developing efficient multi-modal fine-tuning and Retrieval-Augmented Generation (RAG) methods with limited data. Current models also struggle with personalisation that adapts to user history, behavior, and spatial context with proper semantic grounding.

Research and Innovation Objectives:

Develop methods for multi-modal fine-tuning of foundation models using limited domain-specific data
Create efficient Retrieval-Augmented Generation (RAG) pipelines to integrate structured and unstructured domain knowledge
Enable model personalisation that adapts to individual usage history, behaviour, and spatial context, ensuring robust semantic grounding

2f.14 Foundation models for reasoning, planning, and tool use

XR systems need autonomous agents for user assistance, task planning, and adaptation. While foundation models excel in text, they struggle with contextual planning, memory, and tool use. Integrating structured reasoning with external tools like scene graphs, physics engines, and 3D interfaces is key to enabling real-time intelligent assistance, benefiting developers, immersive learning designers, and DT integrators.

Problem Definition and Research Gap: XR tasks require contextual and sequential reasoning, including memory, long-horizon planning, and 3D grounding. Key gaps include weak task planning in foundation models, limited scene understanding and spatial memory integration, and inadequate frameworks for using external tools like 3D viewers and spatial databases.

Research and Innovation Objectives:

Combine language reasoning with spatial mapping and planning APIs, such as using Chain of Thought (CoT) prompting with 3D graph search
Develop multi-agent architectures for collaborative XR environments using foundation models
Integrate external tool APIs, like physics simulators and CAD libraries, into foundation model workflows
Design memory-augmented architectures for consistent, history-aware interactions and experiences in XR

AI for Avatars

Creating lifelike, expressive avatars is key to VW but presents major AI challenges. Current methods are slow, costly, and can reinforce biases, limiting inclusion and diverse expression. AI’s advanced capabilities also raise concerns about impersonation and realistic fakes, threatening trust and psychological safety in virtual interactions.

2f.15 AI for avatar design and creation

AI for avatar design and creation empowers users to express themselves authentically and creatively in digital spaces. It supports diverse representation and democratises avatar creation across various technical and creative skill levels. This addresses the critical need for inclusive, accessible, and safe digital identities within virtual environments, benefiting end users, designers, developers, and accessibility advocates.

Problem Definition and Research Gap: Current AI avatar tools often lack diversity in representation, may perpetuate algorithmic bias, and can be challenging for non-experts to use. There is a pressing need for AI-driven solutions that ensure inclusivity, prevent the creation of harmful or misleading avatars, and provide accessible, intuitive interfaces for a wide range of users, including those with diverse abilities.

Research and Innovation Objectives:

Develop generative AI models, such as diffusion and GANs, with explicit controls for diversity in ethnicity, gender, age, body type, and ability
Integrate bias detection and mitigation modules, along with safety filters, into avatar creation pipelines
Design user-friendly interfaces with features like real-time customisation and accessibility functionalities (e.g., voice-guided creation, templates for low-vision users)

2f.16 AI for avatar control and autonomy

AI enables avatars to shift between user-driven and autonomous control, enhancing immersion, expression, and believability in virtual environments. It supports adaptive, emotionally expressive interactions while improving engagement and accessibility. Intuitive interfaces help users manage avatar autonomy, ensuring comfort and psychological safety, benefiting developers, designers, and end users.

Problem Definition and Research Gap: Current avatar systems lack robust, explainable AI for nuanced behaviour and context adaptation. There is limited support for seamless transitions between user control and autonomy, and absence of intuitive user interfaces for managing it. Existing models frequently fail to provide deep contextual awareness or believable independent action, particularly in complex, multi-agent VW, creating a significant research gap.

Research and Innovation Objectives:

Develop explainable AI models for behaviour modelling, leveraging reinforcement learning, imitation learning, and cognitive architectures
Integrate deep contextual awareness of geometry, objects, and other avatars to enable adaptive behaviour
Design intuitive user interfaces for live control and customisation of avatar autonomy, ensuring clear user oversight mechanisms

2f.17 Conversational AI and autonomous Non-Player Characters

Conversational AI and Large Language Model (LLM)-driven autonomous Non-Player Characters (NPCs) enable the creation of engaging, believable, and interactive NPCs and avatars that can hold natural, contextually relevant conversations. This enhances immersion, player satisfaction, and the overall quality of virtual experiences. This benefits game developers, VW designers, and educators.

Problem Definition and Research Gap: Current NPCs often lack comprehensive contextual awareness, persistent memory, and the ability to conduct coherent, emotionally intelligent dialogues. There is a need for models that can ground conversations in the virtual world, maintain consistent personalities, remember past interactions, and understand complex user intents and emotions.

Research and Innovation Objectives:

Develop LLM-based conversational agents with multimodal input (text, voice, gesture, facial expression) and robust world grounding
Implement persistent memory, personality modelling, and emotion recognition capabilities in these agents
Integrate ethical alignment frameworks to ensure safe and inclusive interactions

AI for social interaction, crowd simulation, and emergent dynamics enhances the realism and vibrancy of virtual environments by enabling avatars to engage in believable social interactions and group behaviours. This significantly improves user immersion and psychological safety, benefiting game developers, VR designers, social scientists, and urban planners.

Problem Definition and Research Gap: Existing models often lack the capability to generate nuanced non verbal communication, such as gaze, posture, and gestures, and struggle to effectively align multimodal interaction cues. Additionally, simulating emergent crowd behaviours that adhere to social norms and contribute positively to virtual environments remains a complex challenge, limiting the authenticity of virtual social spaces.

Research and Innovation Objectives:

Develop AI models capable of generating nuanced non-verbal communication and aligning multimodal interaction cues
Research and implement methodologies for simulating emergent crowd behaviours that adhere to social norms within virtual environments
Explore agent-based modelling, deep reinforcement learning, and Large Language Models for realistic social dynamics

AI for user interaction and human-AI collaboration

Creating intuitive and adaptive AI interactions in VW is challenging. Real-time user modelling—tracking state, skills, preferences, and cognitive load—is essential but complex. Current systems struggle with predicting behaviour and intentions in dynamic environments where goals shift rapidly. While VW offer accessibility potential, developing AI that seamlessly adapts content in real-time remains a significant task.

2f.19 Human-AI collaboration frameworks and interaction paradigms

Integrating AI into virtual environments can augment human capabilities, address individual limitations, and enhance accessibility. This includes real-time content adaptations, such as colour adjustments for colour-blind users, dynamic text simplification, language translation, and sign language interpretation. Such advancements will benefit VR developers, accessibility advocates, and end-users with diverse needs.

Problem Definition and Research Gap: Current virtual environments often lack systems to adapt to diverse user needs, leading to suboptimal UX. There is a need for frameworks that enable AI to seamlessly collaborate with humans, providing personalised assistance and enhancing interaction intuitiveness, thereby improving the overall human-AI partnership.

Research and Innovation Objectives:

Develop AI-driven frameworks that augment human strengths and compensate for limitations through real-time content transformation and assistive features
Align expectations and build mutual understanding between humans and AI agents via transparent information visualisation
Enable natural and intuitive multimodal interactions with AI agents, assisting in complex tasks like data exploration, collaborative design, and guided learning
Fusion of modalities for interaction and communication driven by AI models: As LLMs are additionally capable of processing and generating multimodal information such as a combination of visual and verbal information new interaction paradigms are emerging and will create new opportunities for human-in-the-loop based applications, such as virtual instructors/tutors and cognitive companions in decision support.

2f.20 Intelligent multi-agent systems for dynamic and coherent interactions

Interactivity of virtual environments should be enhanced by enabling agents to collaborate effectively, exhibit emergent intelligent behaviours, and provide a coherent UX. This will help game developers, VR designers, in creating dynamic, responsive, and engaging VW with believable autonomous entities.

Problem Definition and Research Gap: Existing multi-agent systems often lack the ability to coordinate dynamically and coherently in virtual environments, leading to disjointed interactions and reduced immersion. There is a need for decentralised coordination mechanisms, shared understanding among agents, and strategies for managing collective behaviour in complex settings, which is crucial for creating truly immersive VW.

Research and Innovation Objectives:

Develop decentralised coordination frameworks for multi-agent systems in virtual environments
Integrate machine learning for adaptive behaviours and implement communication protocols for shared understanding among agents
Design algorithms for emergent collective behaviours to create more dynamic and realistic virtual interactions

2f.21 Explainable AI for Extended Reality interactions and decision support

Explainable AI (XAI) enhances transparency and trust in AI-driven XR applications by providing users with clear explanations of AI decisions. For XR developers, UX designers, and end-users in immersive environments, this improves user engagement and decision-making, fostering confidence and acceptance of AI assistance.

Problem Definition and Research Gap: Existing XR systems often lack mechanisms to explain AI-driven actions in a user-friendly manner, potentially leading to mistrust and reduced user engagement. There is a need for XAI techniques tailored for XR environments, including real-time, context-aware, and non-intrusive explanations that maintain immersion.

Research and Innovation Objectives:

Develop frameworks and methodologies that integrate XAI into XR platforms
Focus on real-time, context-aware explanations that are non-intrusive and enhance user understanding without disrupting immersion

2f.22 AI-Enhanced information visualisation and visual analytics

Complex datasets and AI model outputs need to be converted into intuitive, interactive, and spatially grounded visualisations. For data scientists, business analysts, and VW developers, this facilitates insight discovery, supports decision-making and addresses the challenge of making data accessible and actionable in VW, while ensuring inclusivity for users with diverse perceptual abilities.

Problem Definition and Research Gap: Current visualisation and analytics tools are primarily designed for 2D screens and do not fully leverage the unique affordances of immersive 3D environments. There is a lack of AI-driven techniques that can automatically adapt, personalise, and spatially anchor visualisations in VW, especially with built-in accessibility features for diverse user needs, which is a key barrier to widespread adoption.

Research and Innovation Objectives:

Develop AI models that automatically generate, adapt, and personalise visualisations for 3D environments
Integrate spatial analytics, interactive exploration, and multi-sensory data representation (e.g., haptics, audio)
Emphasise accessibility, including customisable visual encodings and alternative sensory outputs for diverse user requirements

Provide Feedback

Share your thoughts on 2f Applied Artificial intelligence for Virtual Worlds. Your feedback helps shape Europe's Virtual Worlds research priorities.

Research topics for AI for Virtual Worlds​

AI for perception and sensing​

2f.1 Artificial intelligence -based computer vision methods​

2f.2 AI for Audio analysis​

2f.3 AI for multi-modal Human-Machine Interaction and sensor fusion​

Edge AI and distributed intelligence​

2f.4 On-device AI for inference in headsets, haptic devices and sensors​

2f.5 Federated learning for collaborative training without compromising privacy​

2f.6 Distributed intelligence across devices, cloud, and edge networks​

AI for Content Creation​

2f.7 Generative AI for virtual content creation and multi-scale 3D asset generation​

2f.8 Physics-enabled content generation​

2f.9 Storytelling and application workflows​

2f.10 Multi-modality, sounds, audio, and alignment of modalities​

2f.11 AAI for inclusion and cultural awareness​

2f.12 Semantic AI and knowledge graphs and knowledge modelling​

Foundational models for Virtual Worlds​

2f.13 Adaptation and specialisation of foundation models​

2f.14 Foundation models for reasoning, planning, and tool use​

AI for Avatars​

2f.15 AI for avatar design and creation​

2f.16 AI for avatar control and autonomy​

2f.17 Conversational AI and autonomous Non-Player Characters​

2f.18 AI for social avatar behaviour​

AI for user interaction and human-AI collaboration​

2f.19 Human-AI collaboration frameworks and interaction paradigms​

2f.20 Intelligent multi-agent systems for dynamic and coherent interactions​

2f.21 Explainable AI for Extended Reality interactions and decision support​

2f.22 AI-Enhanced information visualisation and visual analytics​

Provide Feedback

Research topics for AI for Virtual Worlds

AI for perception and sensing

2f.1 Artificial intelligence -based computer vision methods

2f.2 AI for Audio analysis

2f.3 AI for multi-modal Human-Machine Interaction and sensor fusion

Edge AI and distributed intelligence

2f.4 On-device AI for inference in headsets, haptic devices and sensors

2f.5 Federated learning for collaborative training without compromising privacy

2f.6 Distributed intelligence across devices, cloud, and edge networks

AI for Content Creation

2f.7 Generative AI for virtual content creation and multi-scale 3D asset generation

2f.8 Physics-enabled content generation

2f.9 Storytelling and application workflows

2f.10 Multi-modality, sounds, audio, and alignment of modalities

2f.11 AAI for inclusion and cultural awareness

2f.12 Semantic AI and knowledge graphs and knowledge modelling

Foundational models for Virtual Worlds

2f.13 Adaptation and specialisation of foundation models

2f.14 Foundation models for reasoning, planning, and tool use

AI for Avatars

2f.15 AI for avatar design and creation

2f.16 AI for avatar control and autonomy

2f.17 Conversational AI and autonomous Non-Player Characters

2f.18 AI for social avatar behaviour

AI for user interaction and human-AI collaboration

2f.19 Human-AI collaboration frameworks and interaction paradigms

2f.20 Intelligent multi-agent systems for dynamic and coherent interactions

2f.21 Explainable AI for Extended Reality interactions and decision support

2f.22 AI-Enhanced information visualisation and visual analytics