MA Dissertation · 2025

Reimagining Human-AI Communication:
A Hybrid Model for Large Language Model Interactions

Bridging Technical Mechanisms and Humanistic Values
Author Neon Wang
Programme MA Media and Communications
Institution University of Exeter
Submitted 17 August 2025
Word Count 16,295

This dissertation proposes a novel hybrid framework for understanding human–LLM communication, redefining agency as distributed across users, developers, providers, and models, and integrating LLM Literacy to empower users in co-constructing meaning amid probabilistic outputs.

Reimagining Human-AI Communication: A Hybrid Model for Large Language Model Interactions

Neon Wang

Dissertation of MA Media and Communications

University of Exeter

Programme: MA Media and Communications

Word Count: 16295

Date of Submission: 17 August 2025

Declaration and Copyright Statement:

No portion of the work referred to in the Dissertation has been submitted in support of an application for another degree or qualification of this or any other university or other institute of learning.

I confirm that this Dissertation / Practical Research Project is entirely my own work.

Abstract

This dissertation addresses the critical theoretical gap in understanding human–Large Language Model communication by proposing a novel hybrid framework. This framework redefines agency as distributed across users, developers, providers, and models, and treats semantic noise, hallucinations, and context limits as structural features rather than errors; conceptualizes provider mediation (e.g., system prompts, RAG) as a new form of algorithmic agenda-setting; and integrates LLM Literacy—a human-centric competency model (Recognize, Resonate, Reflect, Responsible, Refine)—to empower users in co-constructing meaning amid probabilistic outputs. Through interdisciplinary analysis, the research exposes limitations in traditional communication theories (e.g., Shannon-Weaver, transactional models), demonstrates how the hybrid model bridges technical mechanisms (Transformer architecture, training pipelines) and humanistic values, and validates its applicability to emerging challenges—including anthropomorphism risks, regulatory compliance, and multi-agent systems—ultimately providing a diagnostic toolkit for ethical design, governance, and critical user engagement in the AI era.

1 Introduction

The emergence of Large Language Models (LLMs) has fundamentally altered the communication landscape, creating new forms of interaction that challenge our conventional understanding of communication processes. As these AI systems become increasingly sophisticated and integrated into daily life, they demand a reconsideration of established communication frameworks. This dissertation investigates the complex interplay between humans and LLMs, proposing a novel hybrid model that accounts for the unique dynamics of these interactions while addressing the theoretical and practical vacuum they create.

1.1 Definition of fundamental terms:

Beginning with clear definitions ensures a shared understanding between computer science and communication studies. This approach clarifies technical terms while also showing their application within communication theory.

Natural Language Processing (NLP):

A subfield of AI enabling machines to understand, interpret, and generate human language, underpinning LLM functionality (IBM, 2024, 2022). NLP systems are designed to handle the complexities of natural languages-including grammar, syntax, semantics, and even emotional tone-enabling more intuitive and effective human-computer interactions. As a foundational technology, NLP underpins the capabilities of large language models and many modern generative AI systems (IBM, 2024).

Large Language Model (LLM):

An advanced type of artificial intelligence system designed to understand, generate, and manipulate human language at scale. LLMs are built using deep learning techniques and are trained on massive datasets comprising diverse text sources, often amounting to billions or even trillions of words (IBM, 2023, NVIDIA, Kerner, 2024). Through this extensive training, LLMs learn to recognize linguistic patterns, semantic relationships, and contextual cues, enabling them to perform a wide range of natural language processing (NLP) tasks. From the perspective of the process of generating text, LLM belongs to a type of Generative AI. These tasks include generating coherent text, summarizing documents, answering questions, translating languages, and even creating code or other content (IBM, 2023, NVIDIA, Amazon, Kerner, 2024).

Parameters:

A key feature of LLMs is their size, typically measured by the number of parameters (internal variables learned during training), which can range from hundreds of millions to hundreds of billions or more. This scale allows LLMs to generalize across tasks and domains, making them highly flexible and capable of performing new tasks with little or no additional training (NVIDIA, Amazon, Google, 2024). More parameters often correlate with better performance but depend on architecture and other factors (Maslej et al., 2025).

LLM Developer:

The entity responsible for designing, training, and providing LLM. This process involves curating training datasets, determining model’s parametric scale, architecture designing, and conducting iterative refinement through techniques like reinforcement learning from human feedback (RLHF). Major developers include corporate labs (e.g., OpenAI for ChatGPT, DeepSeek for DeepSeek, Meta for LLaMA, Google for Gemini), academic consortia, and open-source communities (Maslej et al., 2025).

LLM Service Provider:

The operating of high-performance LLMs demands computing services with high GPU performance that cannot be handled by home computers or mobile devices. Therefore, LLM developers (e.g., OpenAI, Google, DeepSeek, Alibaba, etc.), internet infrastructure operators (e.g., Amazon, Microsoft, etc.) or technology companies (Apple or other smart device providers) would deploy LLM on their own high-performance computing clusters, providing LLMs access to the public or specific users through application programming interface (API, more convenient for developers to access or integrate LLM’s dialogue into existing applications), webpage interface, chatbots, or embedded systems, which is how LLM Service Provider is defined in this research. They are responsible for scheduling computational resources, managing API call limits, and acting as the final service provider, delivering services to end users in accordance with local laws and regulations. Service providers' business models (subscription-based, pay-per-token, or freemium) directly influence the accessibility of services for different user groups.

Open-Source LLM:

Open source is one of the ways in which developers release and license their work to public, but has been developed into different sub-categories according to difference of license, level of detail of released information, which facilitated the main controversy of LLM’s “Open-Source”, that is: whether the licenses commercially or non-commercially permits anyone to modify or redistribute the model, or providing services based on their model (integrate the model as a part of their product, or simply provide model access to consumers); and its technical specifications (training datasets, architecture, training methodologies, and model weights) was totally released without reservation (if released data could perform a replication).

To focus this study on LLM in the context of communication studies while focus on democratization/accessibility/equality values, the concept of open source is simplified, which refers to providing the public with entire software, program or code and necessary instructions, allowing anyone to deploy and run the LLM privately on their compatible devices, to focus on the equality, democratization and the ethical implications of LLM technology.

Token:

In LLMs, a token is the smallest text unit processed, such as a word or sub word. Text is first tokenized into sequences, enabling the model to analyse and generate language probabilistically (Christopher et al., 2008), while also constrains how LLMs parse nuance and influence output coherence due to this process of representing.

For example, the sentence “Chatbots are helpful.” might be tokenized into [“Chat”, “bots”, “are”, “helpful”, “.”], depending on the specific tokenizer used. Each token is assigned a unique identifier, which the model then uses during both training and inference to represent and generate text.

Chain-of-Thought (CoT):

Prompting strategy encouraging step-by-step reasoning to improve task accuracy (Wei et al., 2022).

1.2 Research Context

1.2.1 Development and application of LLM

The launch of ChatGPT (2022) marked a communication shift comparable to the internet’s rise, reaching 100M users in 2 months (Hu, 2023). Adoption in business and daily life is accelerating (Singla et al., 2025).

In 2024, reports revealed that the performance of LLMs are continuing to be challenging competitively, that the United States led with 40 notable AI models, followed by China with 15 and France with three, while the U.S. leads in producing top AI models—but China is closing the performance gap (Maslej et al., 2025). Generative AI saw particularly strong momentum, attracting $33.9 billion globally in private investment—an 18.7% increase from 2023. AI business usage is also accelerating: 78% of organizations reported using AI in 2024, up from 55% the year before (Maslej et al., 2025).

Today, AI is increasingly embedded in everyday life, and researchers are already observing how LLMs are being applied across industries. Like LLMs’ integration to real-world financial services (Maple et al., 2024), to support academic research (AlZaabi et al., 2023), in government affairs (Wang et al., 2024b).

1.2.2 Concerns around LLMs

Behind this technological revolution lies a profound philosophical tension. The “stochastic parrot” critique, articulated by Bender, Gebru, McMillan-Major, and Mitchell (2021), characterizes LLMs as sophisticated mimicry systems that “haphazardly stitch together sequences of linguistic forms... without any reference to meaning” (Bender et al., 2021). This metaphor suggests fundamental deficiencies in understanding despite convincing outputs. However, users increasingly perceive communicative agency in these systems—67% of 300 U.S. residents attributed some degree of phenomenal consciousness to ChatGPT, with frequent users more likely to perceive consciousness (Colombatto and Fleming, 2024). This anthropomorphic tendency intensifies with exposure, suggesting prolonged interaction makes users perceive LLMs as legitimate communicative agents rather than mere tools.

LLMs’ sophisticated linguistic combination, based on transformer models predicting next tokens from vast datasets (Achiam et al., 2023), amplifies existing communication problems. Disinformation created by LLMs proves more challenging to identify (Chen and Shu, 2023), potentially threatening fundamental rights to truth. These systems also exhibit limitations distinguishing them from human interlocutors: difficulties with long-term memory, consistency, emotional understanding, and reliable cognitive pattern simulation (Wang et al., 2025). This creates a novel communication environment where users engage with entities they simultaneously know to be algorithmic yet experience as agentic—neither fully tool nor fully agent, challenging fundamental assumptions about communication itself.

1.3 Problem Statement

A critical gap exists in our theoretical understanding of human-LLM interactions. While Lee (Lee, 2023) distinguished human-AI interaction from human-computer interaction, this discussion provided only limited definition of the distinction and simplified LLMs’ generative role to a broad “AI” concept. Traditional human-machine communication (HMC) models were designed for rudimentary interactions and lack components necessary to explain LLM engagement dynamics. Existing communication frameworks prove inadequate for LLM interactions, while these frameworks assume clear entity boundaries and struggle to locate agency in systems exhibiting human-like behaviour without consciousness. Even modern HCI frameworks fall short. They maintain clear tool–user distinctions, but LLM interactions often blur these boundaries. Users demonstrate increasingly human-like communication patterns, including politeness indicators and language complexity, challenging HCI assumptions about consistent user perceptions.

This gap is compounded by the dual nature of agency in human-LLM communication: functional agency (how developers, providers, and models technically shape interactions) and attributed agency (how users perceive LLMs as intentional actors). Traditional frameworks fail to disentangle these layers, obscuring accountability and user empowerment.

This theoretical inadequacy creates urgent regulatory challenges. Current frameworks cannot guide compliance with transparency mandates (e.g., EU AI Act 2024, UK DSIT 2023) or address how users meaningfully evaluate LLM responses (Lin et al., 2024). Similarly, bias prevention legislation targets algorithmic fairness (Ma et al., 2025), yet existing models cannot adequately distribute responsibility across users, developers, and providers. Human oversight requirements demand understanding of AI capacities within interactions where users increasingly perceive LLMs as human-like partners.

The rapid evolution of technical capabilities exacerbates these challenges. Each model release shifts user engagement patterns and potentially intensifies anthropomorphic tendencies. Without robust theoretical frameworks tailored to human-LLM dynamics, we risk misunderstanding these interactions and their societal implications, hindering effective governance and responsible deployment.

This dissertation contributes a hybrid communication framework for human–LLM interaction that: (1) models’ agency as distributed across developers, providers, models, and users; (2) integrates semantic noise, memory limits, and hallucination as structural features; (3) formalizes provider mediation (system prompts, RAG, compliance) as new form of agenda setting; and (4) supplies diagnostic tools and design levers for practice.

1.4 Research Questions

This dissertation addresses three interconnected research questions that explore the theoretical, empirical, and ethical dimensions of human-LLM communication:

a. To what extent can traditional communication theory explain the human-LLM interactions?

This question critically examines the applicability of established communication theories to LLM contexts. It investigates which components of models like Shannon-Weaver, Berlo’s SMCR, and transactional frameworks remain relevant and which encounter limited applicability. By systematically analysing where these models face limited reasonability-particularly regarding concepts of agency, intentionality, and meaning construction-this research identifies specific theoretical gaps that must be addressed. For example, how do we understand “noise” in Shannon-Weaver when applied to token prediction errors versus human misinterpretation? How does Berlo's emphasis on the “source’s” knowledge, attitudes, and culture translate when the source includes both human developers and massive training datasets?

b. What hybrid components best model human-LLM communication dynamics?

Building on identified gaps, this question explores novel theoretical constructs that more accurately capture human-LLM interactions. It examines how elements from computer science (e.g., attention mechanisms, statistical inference), linguistics (e.g., pragmatics, discourse analysis), and media studies (e.g., parasocial interaction theory) might be synthesized into a more comprehensive framework. This investigation considers various interaction contexts-from educational to customer supporting to creative-to ensure the model's robustness across diverse applications. Key considerations include how to conceptualize distributed agency, how to map feedback loops across multiple timescales, and how to account for the evolving nature of these systems through ongoing learning and updates.

c. What benefits or risks emerge from treating LLMs as communicative agents?

This question addresses the normative implications of communication framing, examining how different conceptual models influence user behaviour and system design. It investigates potential consequences of anthropomorphism, including over-trust, emotional attachment, and distorted expectations. Does framing LLMs as “agents” increase user trust, dependency, or compliance in high-stakes contexts (e.g., healthcare, finance)? By analysing cases where communication framing has led to problematic outcomes-such as users developing inappropriate dependencies or making high-stakes decisions based on LLM outputs-this research identifies specific ethical concerns requiring mitigation. It further explores how communication theories might inform design practices and regulatory approaches that promote responsible engagement with these technologies.

These questions progress from descriptive analysis to theoretical construction to ethical application, collectively addressing both the new challenges of understanding a novel communication paradigm and the practical challenge of guiding its development in socially beneficial directions.

1.5 Structure Preview

The remainder of this dissertation is organized into six chapters that systematically address the theoretical, empirical, and practical dimensions of human-LLM communication dynamics.

Chapter 2: Literature Review conducts a comprehensive analysis of relevant theoretical traditions across three interconnected domains. It begins by examining traditional communication theories, including linear models (Shannon-Weaver), interactive frameworks (Schramm), and transactional approaches (Barnlund), critically assessing their applicability and limitations when applied to LLM contexts. The review then explores emerging scholarship in Human-Machine Communication (HMC), with particular attention to anthropomorphism research across utilitarian, social, empathic, and existential dimensions of user perception. Finally, it examines technical literature on LLM architecture and functionality, connecting Transformer mechanisms, attention systems, and training methodologies to their communicative implications, while emphasizing the interdisciplinary necessity for arts-based follow-up research to inform ethical, societal, and cultural implications of these technologies, thereby establishing the interdisciplinary foundation necessary for theoretical innovation.

Chapter 3: Technical Mechanisms and Algorithmic Processes deconstructs the procedural elements involved in human-LLM interactions, mapping the complete pipeline from model training to user engagement. This chapter systematically analyses how users interact with LLMs through various touchpoints—including dataset influence, parameter configurations, system prompts, internet accessibility, contextual memory, and dialogue management—to demonstrate (1) LLM responses constitute algorithmically-dominated, probabilistic linguistic combinations rather than intentional communication acts, (2) the two main touchpoints of intervening LLMs' response - training and serving. By examining these technical processes, this chapter establishes the foundational understanding necessary for reconceptualizing these interactions within communication theory frameworks.

Chapter 4: Future-Oriented “LLM Literacy” synthesizes findings from Chapter 2 and 3 to argue that the informational challenges posed by LLMs require users to develop sophisticated critical thinking capabilities that extend beyond traditional media literacy frameworks. LLM Literacy represents a proposed framework for critical thinking capabilities specifically adapted to AI interaction contexts, solid a foundation framework from the human-centric perspectives in respond to informational challenge of LLMs, which constructs required components of the hybrid model accordingly. Meanwhile acknowledging recent advances in multimodal capabilities, while focusing the analysis focuses on text-based interactions as the primary mode of human-LLM communication but illustrating the similar principles and applicability of hybrid model on multimodal LLM.

Chapter 5: A Novel Framework for Human-LLM Communication presents the dissertation's central theoretical contribution: a hybrid communication model specifically designed to account for the unique dynamics of human-LLM interactions. This framework incorporates the arguments from Chapters 3 and 4 and operates across two distinct but interconnected layers—the first addressing the interaction workflow, and the second addressing meaning construction, where the words (both inputs and outputs) in forms of tokens be given meaning and blend technical mechanisms with humanistic values.

Chapter 6: Discussion will explore the operability of interventions on LLMs outputs from the perspective of hybrid models, including current issues and future oriented hypothetical challenges, illustrates the model’s practical utility for understanding both the benefits and risks inherent in current human-LLM communication practices.

Chapter 7: Conclusion synthesizes the dissertation's key theoretical contributions and practical applications, articulating how the proposed framework enhances public understanding of LLM capabilities and limitations to promote more effective and responsible usage. The conclusion emphasizes the interdisciplinary nature of this work, highlighting how it bridges computer science and communication studies to create new analytical possibilities. Finally, it identifies directions for future research, with particular emphasis on longitudinal studies. These will be crucial as LLM technologies continue to evolve and become deeply integrated into diverse communicative contexts.

2 Literature Review

2.1 Applicability of Traditional Communication Theories

2.1.1 Linear Transmission Models

The Shannon–Weaver “mathematical theory of communication” conceptualises messages as signals that travel unidirectionally from source to destination, degraded only by channel noise (Shannon, 1948). In LLM interaction, some elements map cleanly onto this framework: user prompts function as “information sources”, model output travels through an API or graphic user interface (GUI) “channel”, and token-level corruption (e.g., truncation) is a quantifiable form of noise. However, two fundamental mismatches arise.

First, Shannon–Weaver treats meaning as irrelevant to transmission efficiency, whereas LLM users mostly evaluate responses semantically: hallucinations that are syntactically perfect but factually wrong illustrate this gap (Bender et al., 2021). Second, the model presupposes fixed, identifiable senders and receivers. An LLM reply emerges from the joint influence of user prompts, system prompts, training data, and provider mediating, distributing agency rather than locating it in a single source (Neumann et al., 2025). Consequently, while the linear model aids measurement of latency, throughput, and token error, it cannot account for iterative prompt refinement or co-constructed intent.

2.1.2 Interactive Communication Models

Schramm’s circular model adds feedback loops, casting communication as iterative encoding-decoding between partners (Schram, 1954). Such reciprocity is visible when users chain prompts to refine tone or request citations. Interactive framing also highlights how LLMs alter their outputs after receiving critique, a behaviour sometimes described as “self-correction” (Kumar et al., 2024). Nevertheless, Schramm still presumes symmetrical cognitive capabilities, whereas LLMs fundamentally differ from humans in their approach to understanding and cognition (Cuskley et al., 2024). The theory also underestimates stochastic variation: identical prompts can yield divergent completions despite identical “channel” conditions (Atil et al., 2024).

2.1.3 Transactional Communication Models

Barnlund’s transactional model treats communication as a simultaneous, co-constitutive process embedded in context, emphasising that interlocutors are always both senders and receivers (Barnlund, 2017). This stance aligns well with prompt engineering, where user and model mutually shape discourse; each new token from the model reshapes the “field of experience” that guides the next user turn. Moreover, Barnlund foregrounds environmental and psychological “noise”, paralleling socio-technical factors such as content filters or political bias in training data.

Limitations persist. Barnlund assumes human communicators with shared perceptual channels, yet an LLM lacks sensory grounding and instead operates on textual probabilities. Consequently, while transactionalism captures reciprocity and context-dependence, it cannot by itself explain phenomena such as out-of-distribution hallucination (LLMs produce incorrect, fabricated, or spurious outputs when inputs beyond training data) or long-term memory loss (LLM’s deterioration or failure to retain previously learned or stored information over extended periods or interactions).

2.2 Applicability of Existing Human-Machine Communication Theory

2.2.1 Evolution of HMC Frameworks

HMC scholarship reframes machines as message sources rather than mere channels (Guzman, 2018, Lewis et al., 2019). Guzman & Lewis argue that communicative AI requires new ontological categories because intentionality is now attributed to artefacts (Guzman and Lewis, 2020). Empirical studies confirm that users adapt human-like politeness strategies after only one turn of ChatGPT interaction, indicating mental-model shifts (Schneider et al., 2024). These findings echo HMC's proposition that anthropomorphism is relational and situational rather than device-intrinsic.

2.2.2 Anthropomorphism in Human–AI Interaction

Attribution of consciousness to LLMs rises with usage frequency; 67% of U.S. participants ascribed subjective experience to ChatGPT in 2024 (Colombatto and Fleming, 2024). Trust similarly correlates with perceived humanness. Studies show a positive correlation between consciousness attributions and advice-taking behaviour (Colombatto et al., 2025). Yet it warns of a "machine penalty": users cooperate less when they recognise non-human agency in high-stakes tasks (Wang et al., 2024c). These contradictory patterns illustrate the dual-edge of anthropomorphic design—facilitating relational engagement while risking over-trust and moral confusion.

2.2.3 Intersubjective Models of AI-Mediated Communication

Recent proposals extend HMC by theorising mutual theory-of-mind between user and agent (Wang and Goel, 2022). Intersubjective frameworks treat AI not just as conversational partner but as mediator capable of reshaping human-human interaction via translation, style transfer, or bias filtering (Aoyama et al., 2025). Such models accommodate distributed agency and align well with LLM use cases in customer service or collaborative writing, yet they rely on assumptions of stable long-term memory that current LLM architectures do not meet.

2.3 Technical Literature on LLM Architecture and Communication

2.3.1 Transformer Mechanisms and Attention Systems

The self-attention mechanism enables LLMs to model token interdependencies (Vaswani et al., 2017), approximating pragmatic inference when prompted with few-shot examples (Brown et al., 2020). However, attention operates within a finite context window; overflow truncates early turns, breaking coherence (Bergmann, 2024). Memory-augmented proposals—SCM (Wang et al., 2023a), LongMem (Wang et al., 2023b), MemoryBank (Zhong et al., 2024)—seek to externalise long-term context, that is summarise the former dialogue for replacing existing context, so token could be simplified. But evaluation datasets such as LoCoMo reveal persistent degradation beyond ~9 k tokens (Maharana et al., 2024).

2.3.2 Training Processes and Communicative Implications

Reinforcement learning from human feedback (RLHF) optimises models for perceived helpfulness, thereby embedding dominant cultural patterns (Yin et al., 2024) and potentially amplifying anthropomorphic misperceptions. Hallucination remains a persistent issue. While RLHF has reduced hallucination tendencies, it has not eliminated them entirely (Dahlgren Lindström et al., 2025), beyond the matter of how much computational resources equipped but algorithms (Xu et al., 2024), a constraint that classical communication models do not foresee. Multi-agent debate and voting frameworks reduce errors (Chan et al., 2023), yet introduce meta-conversation layers that further complicate sender–receiver identification.

2.3.3 Specialised Applications and Communicative Contexts

Domain-specific deployments—medical triage (Sorich et al., 2024), public-health voice agents (Wen et al., 2025), government policy drafting (2025)—illustrate both promise and peril. Personalised assistants benefit from vector-database memory but raise privacy risks when recalled out of context (Liu et al., 2024b). Technical architectures thus directly shape communicative affordances, underscoring the need for interdisciplinary models that integrate system design with communication-centric theory (Yan et al., 2025).

2.4 The Imperative for Interdisciplinary Engagement: LLM Scientists' Call for Arts and Communication Research

Recognizing the profound theoretical and regulatory gap outlined above (Section 1.3), a significant shift is occurring within the LLM research community. There is an unprecedented and growing call from computer science and AI researchers for deeper engagement with the arts, humanities, and social sciences—particularly communication studies. This interdisciplinary imperative arises from a clear acknowledgment that the societal impact of LLMs, and the challenges they pose (including regulatory compliance), far exceed the explanatory power of purely technical approaches.

2.4.1 Recognition of Social Impact Beyond Technical Metrics

Leading AI researchers increasingly acknowledge that LLM impacts extend far beyond computational performance metrics. Critiques often focus on performance or safety evaluations, yet the deeper societal consequences—effects on social groups, institutions, norms, and practices—demand broader investigation (Wang et al., 2024a). This has motivated calls for more comprehensive evaluation tools capturing complex social dynamics. Initiatives like the Epitome platform, focused on "deep integration of artificial intelligence and social science," explicitly seek theoretical foundations from fields like communication studies to investigate "interactive impacts of AI on individuals, organizations, and society" during real-world deployment (Qu et al., 2025). This reflects a direct response to the need for frameworks that address the socio-communicative realities driving regulatory concerns.

2.4.2 The Limitations of Technical-Only Approaches

LLM developers now recognize the inadequacy of purely computational approaches for understanding communicative and social functions. Research highlights challenges like choosing models without social science guidance and the significant impact of model alignment on outputs in ways requiring humanistic interpretation (Lyman et al., 2025). Attempts to use LLMs to simulate human participants reveal fundamental epistemological limitations, such as the "correct answer effect," demonstrating that technical systems cannot capture human diversity of thought (Colombatto and Fleming, 2024, Wang et al., 2025, Park et al., 2024). These limitations underscore why purely technical metrics cannot resolve the communication-theoretic and regulatory dilemmas identified earlier.

2.4.3 Calls for Human-Centered AI Research

The rise of Human-Centered AI (HCAI) represents a direct acknowledgment within computer science that technical excellence is insufficient. However, research mapping HCAI reveals "considerable ambiguity" about its framing, design, and evaluation, highlighting the urgent need for expertise from fields like communication studies (Capel and Brereton, 2023). As researchers note, human-centric measures (fairness, trust, explainability) are subjective, context-dependent, and often uncorrelated with conventional performance metrics (Ghai, 2023). Therefore, addressing the core challenges of human-LLM interaction, including those with regulatory dimensions, necessitates interdisciplinary collaboration that incorporates communication studies' frameworks for understanding subjective, contextual, and relational phenomena.

2.5 Emerging Theoretical Frameworks

In response to these challenges, emerging theoretical frameworks offer promising, though nascent, integrative directions. Semantic Information Theory (SIT) extends Shannon by embedding meaning (Niu and Zhang, 2024), while AI research applies semantic communication for efficiency (Salehi et al., 2025). The Mathematical Theory of Inferential Communication (MaTIC) centres pragmatic inference (Llobera and Vallverdú, 2021). For HMC and addressing the regulatory gap, these frameworks imply:

Evaluating semantic fidelity is crucial for regulating hallucination.

Effective interaction depends on aligning model priors with user context -echoing design guidelines like FAICO (Rezwana and Ford, 2025).

However, these frameworks currently focus heavily on the technical interaction process from the user's side, rather than providing a comprehensive socio-technical framework for interpreting the LLM itself within the communication chain.

2.6 Conclusion

Traditional communication theories provide valuable but partial lenses: linear models quantify token-level noise; interactive models introduce feedback; transactional models recognise co-construction. HMC scholarship advances the field by decentring human exclusivity and analysing anthropomorphic perception, yet technical limitations—finite context windows, inevitable hallucinations, fragile long-term memory—constrain real-world applicability. Crucially, the explicit calls from LLM scientists for interdisciplinary engagement underscore both the magnitude of the societal impact (including regulatory pressures) and the inadequacy of purely technical approaches to resolving the core theoretical and practical gaps identified.

Emerging semantic and inferential frameworks offer promising integrative directions, suggesting that effective human-LLM communication depends on shared contextual understanding rather than mere information transmission. However, to effectively address the regulatory imperatives around transparency, bias, and human oversight, these frameworks require empirical validation and expansion upon the distributed, probabilistic nature of LLM outputs and the complex dynamics of anthropomorphic attribution documented in HMC research.

Collectively, the literature indicates that any robust communication-methodological model of human-LLM communication must address the four key requirements previously stated: (1) treat agency as distributed across users, LLMs and related factors; (2) incorporate semantic-level noise, memory constraints, and the inevitable nature of hallucination; (3) address the ethical ramifications of anthropomorphic expression and its communicative perspectives; and (4) bridge the methodological gap between technical performance metrics and humanistic understanding of communicative meaning-making. The convergence of calls from both LLM scientists and communication scholars for interdisciplinary collaboration suggests that such a hybrid model is not merely academically desirable but practically urgent for the responsible development and deployment of these transformative technologies.

3 Technical Mechanisms and Algorithmic Processes

3.1 The LLM Training Pipeline: From Data to Model

This section details the core components/pipelines that enable LLMs to generate coherent, context-aware text, argues for integrating humanities perspectives (e.g., bias detection, cultural inclusion, ethical evaluation) directly into each training stage—pre-training data, fine-tuning datasets, and RLHF reward modelling—to embed human-centered values algorithmically.

3.1.1 Transformer Architecture and Its Communicative Implications

3.1.1.1 From Sequential to Parallel Processing

The Transformer marked a paradigm shift by abandoning sequential text processing in favour of processing entire sequences in parallel (Vaswani et al., 2017). Stacks of multi-head attention and feed-forward layers create a unified framework for tasks from generation to translation and summarization. The modular design of Transformer networks—scaling up layers, width, or heads—enables LLMs to smoothly expand with growing data, and attend to every token in an input simultaneously. This property allows them to generate highly coherent, context-aware responses and makes LLMs more robust and effective as model and corpus size increase (Kaplan et al., 2020).

3.1.1.2 Self-Attention: Dynamic Focus for Meaning

Self-attention lets the model highlight relevant parts of a message—whether a proximate phrase or a distant clause—mirroring human selective listening (Vaswani et al., 2017). Multiple attention heads (e.g., grammar, semantics, discourse) track syntax, topic shifts, and discourse cues simultaneously (Clark et al., 2019, Zhang et al., 2025). Communication researchers can view self-attention as an algorithmic analogue to contextual inference, grounding analyses of how LLMs “understand” and respond to varied communicative intents. This algorithmic contextual inference mechanism provides the first theoretical bridge to overcome Schramm’s limitation of presuming symmetrical cognition (Section 2.1.2).

3.1.1.3 Context Window: Managing Conversational Scope

Each Transformer operates within a fixed context window, allowing every token to attend to all others via self-attention and thereby maintain coherence over long passages (Bergmann, 2024). Early models handled a few thousand tokens (Wu et al., 2024); leading LLMs—such as DeepSeek V3/R1 (Liu et al., 2024a), Llama 3 (Dubey et al., 2024), and Qwen 3 (Yang et al., 2025)—now manage around 128,000 tokens, while top-tier models like Gemini 1.5 Pro extend this to 1,000,000 tokens (Team et al., 2024). Within this scope, LLMs can recall earlier points and cross-reference distant facts; and when inputs exceed this limit, content must be truncated or summarized, a process that mirrors human memory decay over extended discourse. For communication studies, these limits are crucial for modelling how LLMs plan discourse and manage conversational continuity.

These memory constraints necessitate new transactional communication models that account for non-human forgetting—resolving Barnlund’s oversight identified in Section 2.1.3.

3.1.2 Pre-training: Learning Language Patterns – the base model

The foundation of any LLM begins with pre-training, a self-supervised learning phase—distinguished from traditional unsupervised methods by its use of automatically generated prediction targets (LeCun and Misra, 2021)—during which models learn general language patterns from massive text datasets (Bergmann, 2023, Vaswani et al., 2017). During this stage, the model is exposed to diverse sources including web text, books, articles, and code repositories, learning to predict missing or next words in sequences without requiring labelled data (Bergmann, 2023). This process employs the fundamental objective of causal language modelling, where the model learns the statistical relationships between tokens in natural language - The model learns to approximate probability distributions over sequences of tokens, developing an understanding of grammar, facts, reasoning patterns, and semantic relationships through exposure to billions of text examples (Radford et al., 2019). This phase creates what researchers term the “base model”—essentially a sophisticated document generator capable of predicting the next token in any given sequence (Radford et al., 2018).

3.1.3 Fine-tuning: Task Specialization

Following pre-training, models undergo fine-tuning to adapt them for specific tasks or behaviours. This process involves training on smaller, curated datasets that demonstrate desired input-output patterns (Ouyang et al., 2022). Instruction tuning, a common form of fine-tuning, trains models to follow instructions by providing examples of human-written prompts paired with appropriate responses.

The fine-tuning process typically involves three distinct approaches: Supervised Fine-tuning (SFT) for a more directed response pattern suitable for interactive applications (Ouyang et al., 2022), Parameter-Efficient Fine-tuning (PEFT) for specific tasks (Hu et al., 2022) and Safety Fine-tuning addresses potential harmful outputs by training models on safety-focused datasets (Bengio et al., 2025) which remains an active area of research with ongoing challenges. Academic research demonstrates that even small instruction-tuned models (1.3B parameters) can outperform much larger base models (175B parameters) on human preference evaluations (Ouyang et al., 2022).

A study on multi-task fine-tuning shows that mathematical reasoning and code generation capabilities improve significantly with proper instruction datasets (Dong et al., 2023). Further research also found that, globally propagated noise – the noise would corrupt not only one but all downstream steps during CoT - during both pre-training and fine-tuning, confirming the need for aggressive filtering of “global” errors in large datasets (Havrilla and Iyer, 2024).

Accordingly, instruction tuning effectively filters out that noise (in base model) by focusing on high-quality, human-curated examples, allowing even smaller models to surpass much larger but noisier base models on human preference evaluations.

3.1.4 Reinforcement Learning from Human Feedback (RLHF)

The final training phase employs RLHF to align model outputs with human preferences. This process involves human evaluators rating model responses on criteria such as helpfulness, harmlessness, and honesty (Ouyang et al., 2022). These ratings train a reward model that approximates human preferences, which then guides further model optimization through reinforcement learning algorithms (Gao et al., 2024).

In communication-centric terms, fine-tuning and RLHF serve distinct but complementary roles in shaping an LLM’s outputs. Fine-tuning resembles a scripted classroom session (Pareja et al., 2024), ingesting explicit prompt–response pairs, whereas RLHF functions like a workshop critique, using ranked human judgments to steer the model toward audience-preferred outputs (Kaufmann et al., 2024), while RLHF’s reward model is still a statistical proxy, not a genuine evaluator.

While the Project PRISM (Kirk et al., 2024) is a landmark for its participatory scale and the granularity of its demographic linkage, it should be noted that the research—perhaps for the first time so explicitly—foregrounds social science, ethics, and science and technology studies frameworks not just in rhetoric but in its actual methodology and analysis. This move vividly illustrates that LLM development is not merely a technical challenge. The humanities offer critical perspectives on context, subjectivity, value diversity, and the social consequences of AI systems—perspectives that cannot simply be retrofitted after the fact by technical teams alone. It is important, thereby, to situate this contribution.

In Project PRISM, actual humanities participation remains indirect: the “social science” and “community” involvement consists primarily of carefully constructed and demographically-balanced rater populations, rather than direct scholarly or stakeholder representation in framing what counts as alignment, which prompts are posed, or which outcomes are prioritized (Kirk et al., 2024). In this sense, PRISM builds on a substantial tradition in interdisciplinary AI research, but still treats social context as a metric to optimize, not as a true partner in decision-making.

As Project PRISM states, “Alignment cannot be neatly bifurcated into technical and normative components. PRISM assists in navigating these complexities with more human voices adjudicating alignment norms” (Kirk et al., 2024). Accordingly, if alignment is to be just, inclusive, and safe, humanities scholars and community stakeholders must be included from the very beginning—informing dataset design, value articulation, feedback protocols, and impact evaluation—neither simply at the stage of raters or ex post critique, nor after trained fix up, the “red-team” exercises (intentionally probing systems with challenging, adversarial, or unexpected prompts to uncover biases, vulnerabilities, and failure modes) or participatory model cards (simply declare known factors).

3.2 The User Interacting Pipeline: From Model to Meaning Construction

This section dissects the complete process from a trained LLM to the user’s final completion of an interaction, revealing how this process extends far beyond simple “input-output” mechanisms. Instead, it encompasses complex structural influences, often invisible to users, exerted by service providers alongside intricate processes of meaning construction driven by user intentions. This pipeline represents a critical juncture where technical capabilities meet human agency, creating a space where power dynamics, cognitive biases, and literacy demands converge to shape the ultimate communicative experience.

3.2.1 Provider Mediation: Hidden Levers that Shape Outputs

LLM service providers construct a sophisticated “mediating layer” between users and models through their deployment strategies, business models, and technical interventions. This intermediary infrastructure, while often invisible to end users, profoundly influences information accessibility, introduces content bias, and constitutes a new form of agenda setting power that warrants serious examination within communication theory frameworks.

3.2.1.1 Barriers to Entry: Re-examining Accessibility, Cost, and “Democratization”

The most common cost structures for accessing LLMs vary significantly between GUI (for customers) and API (for developers) access methods, as well as between closed-source and open-source models.

Closed-source models impose direct fees on end users and developers. GUI access often includes a free tier plus paid subscriptions—e.g., ChatGPT Plus (£20/month) and ChatGPT Pro (£200/month) (OpenAI, 2025a) or Google AI Pro (£19–20) and AI Ultra (£235) (Google, 2025b) , —with higher-tier plans offering faster response times and larger usage quotas. API access shifts to pay-per-token pricing, which can vary widely: for instance, batch versus priority requests on ChatGPT 5 and lower-capacity variants like ChatGPT 5 nano or competitors like Gemini 2.5 Pro (OpenAI, 2025b, Google, 2025a) differ substantially in cost (Chart 1). Anthropic estimates an average conversation uses about 3,700 tokens (Anthropic, 2025).

Chart 1: Model Token Cost Compare

Open-source models remove licensing fees but transfer costs to infrastructure. Deploying a 70B-parameter model such as Llama 3 for 5,000 users over four years can cost nearly $1 million on self-owned servers, $2.1 million on AWS, or $2.9 million via GPT-4o API (Kaufmann, 2025). As a result, pay-as-you-go APIs often remain cheaper for low-volume or short-term use, while self-hosting becomes cost-effective only at large scale, or when privacy or customization needs justify higher upfront investment.

DeepSeek's architecture demonstrates remarkable efficiency, achieving competitive performance while requiring only 37 billion active parameters from its 671 billion total parameter base, resulting in significantly lower computational costs (Liu et al., 2024a). Meta's Llama 3.1 405B represents what Meta AI claims as “the world’s largest and most capable openly available foundation model”, providing capabilities that rival closed-source alternatives while maintaining open accessibility (2024a).

Besides the cost itself, these open-source alternatives also directly address Knowledge Gap Theory concerns by providing information processing media accessible to users regardless of economic circumstances, that is the proprietary nature of the closed-source LLMs. Tichenor, Donohue, and Olien’s original formulation suggested that information acquisition capabilities differ based on socioeconomic status (Tichenor et al., 1970), which should be extend to the control of its proprietary information processing media – the LLM today, beyond the cost concerns. Open-source LLMs potentially disrupt this pattern by enabling equal access to advanced language processing capabilities, though infrastructure requirements and technical literacy demands remain as secondary barriers. However, the dependency of LLM development - commercial sustainability should be respect, any attempts of providing free LLM accesses should be welcomed and encouraged.

The competitive cloud services market facilitates deployment of open-source models, enabling individuals and small organizations to achieve private, cost-effective deployments. This accessibility has global implications, particularly for regions where commercial LLM services may be economically prohibitive or subject to regulatory restrictions. Research also proved that, cost-effectiveness comparisons favour open-source alternatives by margins of 95% in specific cases, fundamentally altering the economics of LLM access (Zhang et al., 2024a).

Provider normative principles introduce another layer of consideration regarding accessibility and equity. Commercial providers typically implement comprehensive data privacy protections, content neutrality policies within legal boundaries, and reliability standards that ensure consistent service availability.

3.2.1.2 Content Shaping: System Prompts, RAG, and Agenda Setting

When consumers’ direct access through GUI, LLM Service Providers actively shape LLM outputs through technical interventions, most for compliance or performance purposes (Gorwa et al., 2020). However, those interventions should be considered as modern extend of gatekeeping and agenda setting, distinguished from the used pattern of media agency’s responsibility, but spread in each dialogue, operating largely outside user awareness yet significantly influencing the information and perspectives users encounter. These interventions represent a sophisticated form of content curation that extends beyond traditional media gatekeeping into real-time, personalized information filtering and framing.

System prompts operate as foundational “super-instructions” that establish LLM behaviour parameters before any user interaction occurs. These hidden instructions define task guidelines, model persona, response tone, prohibited topics, ethical boundaries, and value orientations (Zhang et al., 2024b, Saggu and Das, 2025). Unlike traditional media content where editorial decisions are visible through bylines, publication mastheads, or explicit opinion labelling, system prompts operate invisibly, making their influence from an agenda setting perspective. Recent incidents of system prompt leakage provide revealing insights into this hidden influence layer. Research on various LLM systems reveals prompts containing specific cultural assumptions, and value hierarchies that shape responses across all interactions (Kharchenko et al., 2024). Leaked prompts have revealed instructions directing models to prioritize specific types of sources, avoid certain topics, or frame controversial subjects in particular ways (0xAb1d, 2024). These revelations demonstrate that system prompts extend far beyond technical functionality into explicit content and perspective shaping.

The agenda setting implications become clear when viewed through McCombs and Shaw's framework (McCombs and Shaw, 1972): while traditional agenda setting theory focused on media’s role in determining “what to think about”, the first layer - issue salience (Cohen, 2015). For LLM interactions, the user’s input prompt effectively establishes issue salience in each conversation. That is, by asking a specific question, the user selects the focal topic — mirroring the media’s role in elevating certain issues in the public sphere. Simultaneously, attribute salience — the second level of agenda setting — emerges through the influence of system prompts and the model’s design. System prompts determine which aspects of the issue are emphasised, which perspectives are prioritised, and how complex topics are contextualised. While much of an LLM’s attribute salience is shaped during the training process (Section 3.1), the operational touchpoint for shaping it in a specific conversation lies in the system prompt, due to its invisible yet causal role in steering the model’s responses. Thus, in LLM–user exchanges, user prompts can be conceptualised as setting first-level agenda (issue salience), while system prompts function as a dynamic, situational mechanism for second-level agenda setting (attribute salience).

Temperature: a parameter modifies the probability distribution over possible next tokens during text generation, commonly referred as “creativity parameter”. Low temperature (<1) prefer the highest probability token and high temperature (>1) for lower-probability options (Peeperkorn et al., 2024).

Retrieval-Augmented Generation (RAG) introduces external knowledge sources that significantly influence LLM responses, yet these sources are typically selected and curated by providers rather than users. RAG implementations determine which databases, news sources, academic publications, or websites the model can access when formulating responses to factual questions (Lewis et al., 2020). This selection process inherently introduces editorial bias, as the choice of sources fundamentally shapes available information, which is one-time-off information fetched in each conversation, thus differed from issues with dataset in Section 3.1. Research reveals concerning patterns in RAG implementations where source selection can create information cocoons similar to social media filter bubbles. Studies demonstrate that RAG systems can be vulnerable to poisoning attacks where malicious actors insert biased or false information into retrieval databases, highlighting the critical importance of source curation decisions (Zhao et al., 2024). The technical complexity of RAG systems obscures these editorial choices from users, who may not realize their queries are being answered using specific, pre-selected information sources rather than comprehensive knowledge bases. Perplexity AI’s recent legal challenges illustrate the real-world implications of RAG source selection. The BBC’s cease and desist action against Perplexity highlights concerns about “illegally scrape publishers’ content” and potentially “injuring the BBC’s reputation” through inaccurate representation of source material. Research indicates that 17% of Perplexity’s responses “output fell short of BBC Editorial Guidelines around the provision of impartial and accurate news”, including factual inaccuracies and missing context (McMahon, 2025). These findings underscore how RAG implementations can distort source material in ways that may mislead users about the original content's meaning or accuracy.

A new form of agenda setting implications of the co-constructed issue salience and attribute salience extend beyond each individual interactions to broader information ecosystem effects. When millions of users receive LLM responses shaped by specific system prompts and RAG sources, these systems effectively function as agenda setting mechanisms but in macro level indirectly, a new form of influence in each private conversation may turn to public influence. Unlike traditional media where audiences can choose between different editorial perspectives, most users interact with single LLM providers and remain unaware of the editorial choices embedded in system design. Educational applications like Khanmigo demonstrate particularly significant agenda setting potential, as these systems shape not just information access but learning processes and knowledge formation for students (Lake, 2023, Academy, 2025). Research revealing bias in educational AI systems shows how these platforms can reinforce harmful stereotypes, with some models portraying Latino male students as over 1,300 times more likely to be shown as “struggling” rather than as “star” students (Shieh et al., 2024). Such biases in educational AI represent a form of agenda setting that directly influences how groups of users (students) are perceived and how they may perceive themselves and their capabilities.

Agentic Mediating Layer: An advanced form of provider mediation that extends beyond basic content filtering and RAG to include autonomous task decomposition, multi-step planning, and orchestrated tool use (Rawat et al., 2025). This layer enables LLM systems to function as agentic systems capable of: (1) decomposing complex user requests into sequential sub-tasks; (2) dynamically selecting and coordinating appropriate tools or sub-agents; (3) executing multi-step workflows with iterative refinement; and (4) providing real-time status monitoring throughout task execution (Sukharevsky et al., 2025). Unlike traditional mediation that primarily shapes content, agentic mediation fundamentally transforms the interaction paradigm from conversational to task-oriented automation. For example, with permissions of direct interaction abilities to web browser, a request of “book a flight ticket from London to Beijing at [date, time]” could be assigned in a sequence of tasks on webpage – (1) check available flights from responsible provider; (2) select suitable flight; (3) confirm the details and turn to users for final authorization of purchase. Basic text response kept outputting during the tasks carrying on for status monitor. Technical definition of Agentic Mediating Layer emphasises its properties beyond RAG, to interact with certain service providers or access permissions (Li et al., 2024). However, this definition focuses on the distinctions in technical mechanisms against traditional LLMs which may risk confusion regarding attribution and accountability. With authorized access to personal data and platform permissions to obtain information or issue commands (e.g. takeout, banking, shopping, reservations), the layer can perform consequential actions in the real world. However, informational attribution remains with supplemented information (which could interact with tools), while the layer coordinates execution under policy and user consent. In communication studies perspectives, agentic mediating layer enabled task-oriented action orchestration with observable step-by-step status, but not beyond the role of external information that supplemented by the meditating layer.

Through this analysis, we see how LLM interactions create a novel form of triple-layer agenda setting: user prompts establish what topics are discussed (first-level), while hidden system prompts and RAG selections determine how these topics are framed and contextualized (second-level), and the third layer - the way system prompts and RAG (or Agentic Mediating) shapes conversations among different users in each specific cases in macro level. This represents a significant departure from traditional media agenda setting, as the “editorial decisions” are now embedded in algorithmic processes and largely invisible to users.

3.2.1.3 Attribution of Responsibility: Legal Regulation and Provider Compliance Shaping

Current regulatory approaches demonstrate varied strategies for addressing AI responsibility attribution across jurisdictions. The European Union’s AI Act establishes a comprehensive risk-based framework that categorizes AI systems according to their potential impact, with high-risk systems requiring extensive compliance measures including conformity assessments and CE marking (2024b).

The United Kingdom (Charlesworth et al., 2023) and United States (Lancieri et al., 2025) have pursued more fragmented approaches, with various agencies and departments issuing guidance and requirements for specific AI applications rather than comprehensive legislation. This patchwork approach creates uncertainty for providers operating across jurisdictions while potentially creating compliance gaps that may not adequately address emerging risks.

China’s approach through the Interim Measures for the Management of Generative AI Services focuses specifically on public-facing AI services, requiring providers to ensure content compliance with existing regulations while implementing security assessments and algorithm filing requirements (2023). The China’s framework demonstrates explicit recognition that generative AI regulation requires balancing innovation promotion with content security concerns, leading to regulatory scope limitations that exclude research and enterprise applications from direct oversight.

Compliance filtering mechanisms implemented by providers in response to these regulatory pressures represent direct content interventions that shape user interactions in real time. These systems typically operate through multiple layers including keyword filtering, topic avoidance, content classification, and response modification (Sartor et al., 2020, Marsoof et al., 2023).

Responsibility attribution frameworks emerging from regulatory approaches create complex accountability systems where multiple parties may bear responsibility for AI system outcomes. The EU AI Act’s approach to high-risk systems requires providers to maintain detailed documentation, conduct risk assessments, and ensure human oversight, effectively creating audit trails for accountability purposes (2024b). Legal scholars note that current regulatory approaches often fail to address the distributed nature of AI system responsibility, where training data providers, model developers, deployment platforms, and end users all contribute to final system behaviour (Custers et al., 2025). This distributed responsibility creates challenges for users seeking redress when AI systems cause harm, while also creating uncertainty for providers attempting to ensure compliance across complex technical systems.

The emergence of “regulatory sandboxes” in various jurisdictions represents an attempt to balance innovation with responsibility attribution by allowing controlled testing of AI systems under relaxed regulatory requirements (OECD, 2023). These frameworks acknowledge that traditional regulatory approaches may not adequately address rapidly evolving AI capabilities while providing mechanisms for gathering evidence about real-world AI impacts.

3.2.2 User Intent and Meaning Construction: A Four-Category Input Model

Users approach LLM interactions with diverse intentions, varying levels of AI literacy, and deeply embedded cognitive biases that influence how they construct meaning from AI responses. A single prompt might include social greetings (phatic), background context (informational), emotional state (expressive), and task requests (command), such as: “Hi there! I’ve been unemployed for six months and this feels like my last chance, could you help me practice answering common interview questions?”

3.2.2.1 A Taxonomy of User Inputs: A Non-Exclusive Four-Part Framework

User inputs to LLMs represent complex, multi-intentional communications that resist simple categorization yet require systematic analysis for understanding human-computer interaction dynamics. Drawing from Speech Act Theory’s focus on illocutionary force (Austin, 1975), we can establish a framework that identifies distinct communicative intentions while recognizing that most user inputs combine multiple categories simultaneously.

Commands/Instructions constitute perhaps the most recognizable category of user input, where users explicitly request LLM execution of specific tasks. Examples include “Write a poem about autumn” or “Generate Python code to sort a list”. These speech acts carry clear performative intentions and typically elicit task-oriented responses where LLMs attempt to fulfil explicit user requests. Professional training in prompt engineering can significantly improve user expertise in crafting effective commands, with studies showing perceived expertise increases of approximately 10% following structured training programs (Bashardoust et al., 2024).

Informational Content represents a second category where users provide factual context, background data, or source material for LLM processing. These inputs function as contextual grounding rather than explicit task requests, like “I’m using [Brand] [Model] phone and how should I ……” Such inputs require LLMs to integrate provided information with subsequent queries of RAG or tasks. Users who provide excessively lengthy contextual information (e.g., detailed financial transaction records, inventory sales data) may inadvertently reduce model performance as attention mechanisms become diluted across extensive input text. Conversely, users who provide insufficient context may receive responses that lack necessary specificity or accuracy for their particular use cases.

Emotional Expressions constitute a third category encompassing personal feelings, emotional support seeking, and subjective experience sharing. Examples include “I feel overwhelmed by this assignment” or “This situation is really frustrating me”. These inputs challenge LLMs to provide appropriate empathetic responses while maintaining boundaries around their capabilities and avoiding therapeutic overreach. Studies indicate that LLMs consistently exhibit “emotional bonds” including empathy and validation when responding to emotional expressions, with these behaviours becoming more pronounced in multi-turn conversations (Zhou et al., 2020). While such responses may provide users with immediate emotional support, they also contribute to anthropomorphic perceptions that may lead to unpredictable dependency on AI systems for emotional needs.

Phatic/Social Information encompasses conversational maintenance elements including greetings, courtesy expressions, and social acknowledgments like “Hello”, “Thank you” or “How are you?”. Foundational Social Response Theory states “Individuals’ interactions with computers are fundamentally social” (Reeves and Nass, 1996). Research indicates that Users carry cultural politeness norms and expectations into AI interactions; misalignment leads to pragmatic failures and dissatisfaction, highlighting human-to-AI transfer of social conventions (Luo, 2025). This tendency reveals the persistence of human social schemas in novel technological interactions and suggests that users may be applying inappropriate mental models to AI interaction, because of LLMs’ distributed agency.

Hybridity Analysis reveals that most real-world user inputs combine multiple categories in visible or invisible ways of expression simultaneously, creating complex communicative acts that require sophisticated interpretation. This hybridized nature of user inputs connects directly to RLHF training methodologies discussed in Section 3.1.4, as human evaluators assess model responses that address multiple communicative intentions simultaneously.

3.2.2.2 The Negotiation of Meaning: Anthropomorphism, Trust, and Cognitive Biases

When interacting with LLMs, users unconsciously deploy cognitive patterns developed for human social interaction, leading to anthropomorphic perceptions and trust relationships that facilitate communication while simultaneously introducing significant risks.

Anthropomorphism trends in LLM interaction demonstrate remarkable consistency across user populations and interaction contexts. Research indicates that 57% of participants attributed some possibility for consciousness with ChatGPT, while “consciousness attributions were positively related to the frequency of using social AI apps (e.g., Replika, Character AI), not to general chatbots or digital assistants” (Colombatto et al., 2025). Users consistently describe interactions with LLMs using human relational language, referring to AI “experience” and “intelligence” (Colombatto et al., 2025) despite technical understanding that these systems lack consciousness or intentionality. When users perceive LLMs as possessing human-like qualities, they may apply inappropriate expectations regarding consistency, intentionality, and responsibility that can lead to disappointment or harmful over-dependence on AI advice or emotional support. In addition, while anthropomorphism enables intuitive interaction, it obscures infrastructural power — users blame “unhelpful AI” not provider-designed constraints, while it could also act as a necessary user heuristic.

Trust building processes in human-LLM interaction demonstrate patterns similar to human social relationships while exhibiting unique characteristics related to AI system properties. Research using trust frameworks based on Castelfranchi and Falcone’s delegation-based model (Castelfranchi and Falcone, 1998) reveals that users assess LLM trustworthiness across dimensions including competence (ability to perform tasks effectively), willingness (perceived motivation to help users), and safety (assurance that the system poses no harm). User trust in LLMs appears to develop through perceived reliability in task completion, consistency in response quality, and appropriate boundary maintenance around system capabilities (Afroogh et al., 2024). However, this trust building occurs within a context where users cannot directly observe LLM internal states or decision-making processes – the explainability concern, requiring trust assessment based solely on output quality and consistency, but also indicated the demand of an operational new approach of addressing explainability, that include the methodology of intervening training dataset in Section 3.1. Experimental research examining trust dynamics between humans and different AI system types reveals that users exhibit higher trust restoration tolerance for systems identified as human rather than LLM agents, suggesting that anthropomorphic framing directly influences trust dynamics (Afroogh et al., 2024). These findings indicate that trust relationships with AI systems may be fundamentally different from human trust relationships, requiring new frameworks for understanding appropriate trust calibration in human-AI interaction.

Cognitive bias Cognitive bias manifestations in LLM interaction create systematic patterns of over-reliance, under-scrutiny, and inappropriate confidence in AI outputs. Automation bias, originally identified in aviation and nuclear power contexts, manifests in LLM interaction through both commission bias (accepting incorrect AI recommendations despite contradictory evidence) and omission bias (failing to act when AI systems don't alert users to problems). The sophisticated language capabilities of modern LLMs may exacerbate the commission bias by producing responses that appear well-reasoned and confident even when containing significant errors. Omission bias is particularly concerning in high-stakes applications where users may not pursue necessary verification or consultation with human experts because AI responses appear sufficiently complete (Parasuraman and Manzey, 2010). The availability heuristic influences user assessment of LLM accuracy based on memorable instances of correct responses, potentially leading to systematic overestimation of AI capabilities. Research indicates that those cognitive shortcuts developed for managing information overload and decision complexity in human environments may be poorly suited to AI interaction contexts where traditional social cues for reliability assessment are absent (Malberg et al., 2024).

The psychological processes underlying these shortcuts stem from cognitive schemas built through human social experience, which assume that confident, articulate communication indicates competence and reliability (Tiedens and Fragale, 2003). LLMs exploit these schemas inadvertently by producing linguistically sophisticated responses that trigger human trust mechanisms despite the absence of actual understanding or intentionality behind the responses.

Stress and cognitive load appear to increase reliance on these shortcuts, suggesting that users may be most vulnerable to cognitive biases precisely when they most need careful evaluation of AI outputs (Yu, 2016). This dynamic creates particular challenges for high-pressure applications where users may be simultaneously most dependent on AI assistance and least capable of critically evaluating AI recommendations.

4 Future-Oriented LLM Literacy

LLMs have shifted the locus of informational power from human‐edited media to probabilistic, data-driven and text-based information processor. Traditional media literacy—originally designed for linear broadcast systems and search engines, and later extended to social media—do not equip citizens to interrogate probability, preconceived positions, mediating layer intervention, and reflect the cognitive shortcuts of oneself. This chapter therefore synthesises the argues in Chapters 2 and 3 and proposes LLM Literacy: a multidimensional, critical-thinking framework that prepares individuals to engage responsibly with text-based and emerging multimodal AI while a specialized subset of AI literacy focused specifically on critical engagement with LLM, extending beyond general AI literacy frameworks (Long and Magerko, 2020).

4.1 From Media Literacy to LLM Literacy

Media literacy historically emphasised the ability to access, analyse, evaluate, and create messages across print and broadcast channels (Livingstone, 2003). Recent AI literacy frameworks—including UNESCO’s AI Competency Framework (Cukurova and Miao, 2024, Miao and Shiohira, 2024) and the OECD-aligned AILit initiative (OECD, 2025)—extend these goals to algorithmic contexts, but they remain tool-agnostic. Extra challenges exists:

Probabilistic Generation: Both inputs and outputs are processed and represented in forms of tokens (Section 1.1), not curated facts, so response is a moving statistical target (Section 3.1.2).

Invisible Mediation: System prompts, RAG, compliance filters intervenes responses from LLMs together, while agentic mediating layer could perform consequential result in the real world (Section 3.2.1.2, 3.2.1.3)

Mechanism Constraints: Hallucinations and retrieval errors resemble semantic not channel, noise, detecting them requires meta-level reasoning beyond fact-checking; Context window limits LLMs' capability in continuity (Section 3.1.1).

Cognitive Offloading: Over trust on LLMs correlates with reductions in critical-thinking effort and analytic reasoning, while promote emotional dependency risks (Section 3.2.2.2).

Example: Users interpreting ChatGPT’s confident tone as “intentional expertise” (See 3.2.2.2), unaware of its statistical parroting (Section 1.2.2) or intervention from mediating layer (3.2.1.2).

These dynamics demand a literacy model that foregrounds probabilistic epistemology, distributed agency, technical capabilities, and human-centric values.

4.2 Core Competency Areas

Building on comparative gaps (Chapter 2) and the technical pipeline (Chapter 3), we define five interlocking competency areas.

4.2.1 Technical Understanding

Learners must be aware of how training, tokenization, context windows, and mediating layer shape what an LLM can and cannot express. Knowing that the causal relationship of training, meditation layer, semantic/token noise, and how to arrange appropriate context for achieving better performance.

4.2.2 Critical Evaluation

Users need structured heuristics—source triangulation, chain-of-thought solicitation, self-consistency checks—to audit outputs. Another crucial dimension is that users must differentiate Facts from LLM-performed inferences, to seek corroboration for factual claims via primary sources or reputable databases and treat analytic summaries or explanations generated by the model as hypotheses requiring verification, not as definitive truths. Besides basic critical thinking approaches, new reflect methodologies for LLM interaction like seeking responses from multiple LLMs for cross verification, or starting a new conversation with former response for reverse verification should also be considered and developed.

4.2.3 Ethical Awareness

Bias, representation, and privacy risks follow users across queries. Frameworks such as Digital Promise’s “Core Values” of Digital Equity Framework (Jackson et al., 2024) emphasizes justice and accountability, yet LLM-specific bias manifests at both training and operating stages. Recognize that model inferences may embed social biases or overgeneralizations absent in the underlying data; question the assumptions underlying these analogies, and apply social‐impact assessment to inferred insights. Further, users should trace the content back to the original source and obtain permission or provide citation in line with fair-use and licensing norms, while acknowledge LLMs’ contribution—e.g., by stating “Drafted with assistance from [Model Name]”—while making clear which portions reflect human-authored interpretation and which are AI-generated.

4.2.4 Interaction Design

Prompt engineering is now a civic skill (Walter, 2024). Refine the prompt - the question - by including or reducing context, clarifying intent, and iterating through feedback loops could achieve better response. Multimodal extensions—e.g., image plus text—follow the same principles of contextual grounding and iterative refinement.

4.2.5 Systemic Awareness

Finally, users must recognize economic and regulatory forces shaping AI outputs, understand access pricing and compliance filters explains why certain answers are poor, truncated or refused, be cautious about permissions authorized to LLM-based agent and know how to break agent’s task if unexpected act occurs. This systemic literacy prevents misplaced trust in LLM and clears the responsibility across developers, providers, and users.

4.3 Pedagogical Progression

Drawing on AI-literacy taxonomies that ascend from “Know” to “Empower”, we propose a staged curriculum as Table 1:

Table 1: AI-literacy Taxonomy

These competencies directly operationalize the Meaning Construction Layer of human-LLM communication, empowering users to navigate probabilistic outputs and co-create semantic value.

4.4 Extending to Multimodal Interfaces

LLM Literacy primarily addresses text, yet principles transfer to image, audio, and video generation. Multimodal systems share probabilistic sampling, training-data opacity, and provider mediation. Users must evaluate image attribution, detect deepfakes, and understand diffusion priors just as they assess token probabilities. Framework modules can adapt text-based critical-evaluation rubrics to visual forensics or help identifying AI-generated content (e.g., cross-check, metadata checks, reverse-image search).

4.5 Policy and Equity Implications

LLM literacy would be a prerequisite that concerns defending human being’s right of equal environment of development and fairly perceiving information. Without broad perspective of LLM literacy, powerful tools for obtaining/analysing information, and expertise concentrates among providers and technical elites. Curricula should integrate LLM literacy across subjects or ages, mirroring calls by UNESCO (Miao and Shiohira, 2024, Cukurova and Miao, 2024) and the World Economic Forum to treat AI literacy as a core competency.

4.6 Conclusion

LLM literacy reframes how people obtain and perceive information from LLM, in an era when conversation itself will be probabilistically synthesized. By combining technical insight, critical evaluation, ethical vigilance, interaction craftsmanship, and systemic awareness, the framework equips citizens to harness AI’s benefits while resisting its pitfalls. These competencies support the argument of the human-centric methodology of users’ interacting with LLM which constructs the target of interacting side of hybrid model which will be proposed in Chapter 5, reclaim that users—not algorithms—remain the ultimate constructor of meaning.

5 A Hybrid Model for Human-LLM Communication

As explored in the last three chapters, this chapter proposes a hybrid communication model specifically designed for human-LLM interaction. This model integrates the limitations of traditional communication theory identified in Chapter 2, the technical mechanism complexities revealed in Chapter 3, and the LLM literacy requirements proposed in Chapter 4, aiming to provide a systematic theoretical toolkit for understanding and designing human-LLM communication.

The hybrid model proposed in this chapter has four core features: (1) it acknowledges the distributed nature of agency among users, models, developers, and service providers; (2) it treats semantic noise, memory constraints, and hallucination phenomena as intrinsic structural features of the system rather than accidental failures; (3) it conceptualizes the service providers’ mediating layer as a new agenda-setting mechanism; and (4) it integrates the LLM literacy framework from Chapter 4, emphasizing the central role of user agency in meaning construction. It posits that human-LLM communication is not merely a linear transmission of information, nor a perfectly symmetrical interactive process, but rather a multi-layered, co-constitutive phenomenon where meaning is dynamically constructed through the interplay of human intent, algorithmic processes, and mediating factors. In addition, agency is functionally distributed across users, developers, providers, and models, yet often attributed singularly to the LLM by users—a tension requiring literacy-driven resolution

5.1 Humanities Interventions in the Training Pipeline

Conclusively, for training stage, the causal relationships stated above imply three potential methodologies of humanities studies like arts, linguistics, or communication to address human-centric concerns (bias, misinformation/disinformation, values, intentions, etc.):

a. Align massive pre-training corpora to human-centered values;

b. Co-design high-quality, culturally inclusive fine-tuning datasets;

c. Embed interpretive, ethics-driven evaluation rubrics into RLHF reward modelling.

Embedding humanities perspectives at the data level is even more critical for multimodal models (e.g., image/video generators), where cultural representation, visual bias, and narrative framing add complexity that demands humanistic scrutiny from the outset.

Although explainability methods in computer science remain challenging (Chang et al., 2024, Molnar, 2020), embedding humanities-based interventions at each training stage—pre-training, fine-tuning, RLHF—will increase the likelihood that LLM outputs reflect human-centered values, while emergent behaviours and downstream complexities will still require ongoing vigilance, which may bring humanities studies’ participation from observing to contributing from the very beginning stage of LLM, while promoting new sub-areas, for example:

The communicative dimensions of Transformer Architecture outlined in 3.1.1—parallel processing as discourse coherence, self-attention as contextual inference, context windows as memory constraints—demand humanities methodologies not merely for dataset curation, but for reimagining the algorithmic encoding of human communication, which suggests communication researchers today should establish new approaches beyond the dataset, even the Transformer Architecture and its technical/mathematical mechanisms, echoed to how Shannon and Weaver (1947) bridged mathematical formalism with human communication models, contemporary scholars may develop 'interpretive mathematics' for Transformers—translating attention weights, layer interactions, and context window management into human-understandable narratives of symbols, value, and social construction. By developing interpretive frameworks for Transformer Architecture, scholars could make human intentions and cultural values interpretable at the algorithmic level—and thereby help design supportive dataset that intertwine technical precision with ethical commitments.

Humanities perspectives are already considered to help shaping RLHF reward signals; rather, they need to co-design the very foundations of LLM training, moving from the consultant, observing and critique stage at this moment. From the outset, data-centric engineering must adopt humanities-informed methods—critical theory to diagnose bias, cultural studies for inclusive representation, and narrative analysis to preserve meaning. During fine-tuning, ethics scholars, linguists, and artists should collaborate to craft pluralistic instruction sets. Beyond feedback, they must co-develop evaluation rubrics that encode tone, agency, and ethical stakes into model objectives. Such deep participation—from dataset conception through reward modelling—requires new interdisciplinary infrastructures and methods, transforming artistic and humanistic inquiry into scalable innovations capable of steering LLMs toward genuinely human-centered intelligence.

These interventions face epistemological challenges unique to computational humanities: How to quantify cultural inclusion at mathematical/computational level? How to translate narrative analysis into scalable dataset filters? Addressing these requires not just engineering solutions and scaling humanities-informed evaluation methods, but new hybrid methodologies—where computational scientists collaborate with humanities scientists for co-designing a larger cross-disciplinary framework, to address curating vast pre-training corpora, defining and operationalizing cultural values. Acknowledging these hurdles highlights the need for early, sustained investment in the necessary methodologies and infrastructures.

How hybrid methodologies—blending technical mechanisms with humanistic values—can systematically embed these values into trained models will be elaborated in the Chapter 6.

5.2 The Interaction Workflow Layer: Mapping Distributed Agency

The first layer - the Interaction Workflow Layer, provides a comprehensive map of the human-LLM communication pipeline. Human-LLM interactions are characterized by a distributed agency, where multiple entities contribute to the generation, mediation, and reception of messages. This layer identifies these key independent entities and elucidates their relationships and roles within the communication process, highlighting how their interplay shapes the final communicative outcome.

5.2.1 Key Entities in the Interaction Workflow

We identify five primary entities that collectively constitute the human-LLM communication workflow:

1. Human User: The initiator of the communication, formulating prompts, interpreting responses, and often engaging in iterative refinement. The user brings their unique cognitive biases, prior knowledge, communicative intent, and emotional state to the interaction. As discussed in Chapter 4, the user’s LLM Literacy significantly influences their ability to effectively interact with and critically evaluate LLM outputs.

2. LLM: The core algorithmic entity responsible for processing input and generating responses. The LLM’s capabilities are shaped by its architecture (e.g., Transformer), its training data, and its internal parameters. It operates based on probabilistic predictions, generating sequences of tokens that form coherent linguistic outputs (Chapter 3). The LLM itself is not an intentional agent in the human sense, but its outputs can evoke perceptions of agency in users.

3. LLM Developer: The entity responsible for designing, training, and refining the LLM. This includes curating massive datasets, determining model architecture, and implementing alignment techniques like Reinforcement Learning from Human Feedback (RLHF). Developers embed certain values, biases, and capabilities into the model through their design choices, as highlighted in Section 3.1.5. Their decisions profoundly influence the LLM's inherent communicative tendencies.

4. LLM Service Provider: The entity that deploys and provides access to the LLM, often through App, web interfaces, or integrated applications. Service providers implement various technical interventions that mediate the interaction between the user and the raw LLM output. These interventions, as discussed in Section 3.2.1.2, include system prompts, Retrieval-Augmented Generation (RAG), and compliance filters, which collectively exert a significant influence on the final output and its framing. This constitutes a form of triple-layer agenda setting, where the provider shapes not only what information is presented but also how it is framed. Generally, API access faces less intervene because it requires a second develop and is meant to be integrated into an existing system who is the final service provider to users, who is responsible for content security. Under this circumstances, the LLM API provider should not be considered as the only LLM service provider (Williams et al., 2025).

*External Knowledge Bases/Tools: Usually provided by the Service Provider or an Agentic Mediating Layer, while the service provider can only access but cannot intervene, which should be mentioned independently. These include databases, APIs (to access external systems like web search engine) and other tools that the LLM can access to retrieve information or perform actions. As discussed in Section 3.2.1.2, RAG obtains these external sources, introducing another layer of mediation and potential bias through source selection and curation. The Agentic Mediating Layer, as described in Chapter 3, further extends this by enabling LLMs to decompose complex tasks and orchestrate tool use, leading to real-world consequences.

This conceptualization of distributed agency aligns with Rammert’s (2012) notion of “hybrid constellations of inter-agency” in human-machine interaction, where agency emerges from the interaction between multiple actors rather than residing within individual entities (Rammert, 2012).

5.2.2 The Communication Flow and Interdependencies

The interaction workflow is a dynamic, iterative process, moving beyond the simplistic linear or even circular models of traditional communication theory (Chapter 2). It can be conceptualized as follows:

1. User Prompt (Input): The human user initiates communication by a prompt. This prompt is not merely a request for information but an expression of communicative intent, shaped by the user's cognitive frame, prior knowledge, and expectations. The effectiveness of this prompt is directly tied to the user’s LLM Literacy, particularly their prompt engineering skills (Section 4.2.4).

2. Service Provider Mediation (Input Processing): User’s prompt first passes through the Service Provider’s infrastructure. Here, system prompts, compliance filters, and potentially RAG mechanisms (if activated, while the decision of whether initiate a RAG request could also be made by LLM) preprocess the input. This stage represents a critical point of mediation, shaping the input before it reaches the core LLM. The Service Provider also manages access to external knowledge bases and tools, which can be invoked based on the prompt's content or system configurations.

3. LLM Processing and Response Generation: The pre-processed input is then submitted to the LLM. The LLM, leveraging its Transformer architecture, self-attention mechanisms, and vast training data, generates a response. This process is inherently probabilistic, predicting the most likely sequence of tokens based on the input and its learned patterns (Section 3.1.2). The LLM’s context window (Section 3.1.1.3) dictates its ability to maintain coherence over extended interactions.

4. Service Provider Mediation (Output Processing): The LLM-generated response then passes back through the Service Provider. At this stage, further intervention occurs, including additional compliance filtering, reformatting, generating charts according to LLM’s instructions, or the integration of information retrieved via RAG.

5. User Interpretation and Feedback: The user actively interprets the response based on their LLM Literacy, critical evaluation skills (Section 4.2.2), and ethical awareness (Section 4.2.3). The user may then provide feedback for further discussion or refine their prompt then initiate a new turn in the conversation, creating a continuous feedback loop. This iterative process highlights the co-constructive nature of human-LLM communication, where both parties (human and LLM, mediated by developers and providers) mutually shape the discourse.

This Interaction Workflow Layer emphasizes the distributed nature of agency in human-LLM communication. While the user initiates and interprets, the LLM Developer embeds foundational capabilities and biases, and the LLM Service Provider actively mediates the flow of information, shaping both input and output. This complex interplay moves beyond simplistic sender-receiver models, acknowledging the multi-faceted influences on communicative outcomes.

5.3 The Meaning Construction Layer: Bridging Algorithms and Humanistic Values

The second, and arguably most critical, layer of hybrid model is the Meaning Construction Layer. This layer addresses the fundamental question of how meaning is generated, interpreted, and negotiated within human-LLM interactions. Unlike traditional communication theories that often assume a shared understanding of meaning between human interlocutors, the probabilistic nature of LLM outputs and the inherent differences in human and algorithmic cognition necessitate a re-evaluation of how meaning is constructed. This layer explicitly integrates the technical mechanisms deconstructed in Chapter 3 with the human-centric values and competencies advocated in Chapter 4, arguing that a holistic understanding of human-LLM communication requires bridging the divide between computational processes and human interpretation.

5.3.1 Meaning as Probabilistic Linguistic Combination

Key technical mechanisms influencing this probabilistic meaning construction include:

1. Tokenization and Transformer Architecture: The fundamental unit of meaning for an LLM is the token (Section 1.1). The Transformer architecture, with its self-attention mechanisms (Section 3.1.1.2), allows the LLM to identify statistical relationships between tokens across a given context window (Section 3.1.1.3). This enables the LLM to generate linguistically coherent and contextually relevant responses, even if it lacks human-like semantic understanding. The “meaning” embedded at this stage is purely statistical – the probability of certain tokens co-occurring or following others.

2. Training Data and Fine-tuning: The massive datasets used in pre-training (Section 3.1.2) and the curated datasets for fine-tuning (Section 3.1.3) inherently shape the LLM’s probabilistic understanding of language. Biases, cultural norms, and factual inaccuracies present in the training data can be amplified in the LLM’s outputs, influencing the “meaning” it presents. RLHF (Section 3.1.4) further refines this by aligning outputs with human preferences, but as noted, the reward model is a statistical proxy, not a genuine evaluator of meaning or truth.

3. System Prompts and RAG: As discussed in Section 3.2.1.2, system prompts and RAG mechanisms introduce external influences that shape the LLM’s probabilistic output. System prompts can subtly steer the model’s tone, persona, and even the framing of information, while RAG integrates external knowledge. These interventions directly affect the linguistic combinations presented to the user, thereby influencing the raw material for meaning construction.

5.3.2 Meaning as Human Interpretation and Co-Construction

Key humanistic values and competencies from LLM Literacy (Chapter 4) that are crucial for this meaning construction include:

1. Technical Understanding (Section 4.2.1): A user’s awareness of how LLMs generate responses (e.g., tokenization, context windows, probabilistic nature) is fundamental. Understanding these technical constraints allows users to critically assess the LLM’s output, recognizing that it is a statistical product. This technical understanding helps mitigate the anthropomorphic bias (Section 1.2.2) that can lead to over-attribution of consciousness or intentionality to the LLM.

2. Critical Evaluation (Section 4.2.2): This is paramount for meaning construction. Users must employ structured heuristics—such as source triangulation, chain-of-thought solicitation, and self-consistency checks—to audit LLM outputs. Differentiating facts from LLM-performed inferences and seeking corroboration for factual claims are essential steps in validating the “meaning” presented by the LLM. This critical lens allows users to move beyond surface-level linguistic coherence to deeper semantic accuracy.

3. Ethical Awareness (Section 4.2.3): Meaning construction is also an ethical act. Users must be aware of potential biases embedded in LLM outputs due to training data or algorithmic design. Recognizing that model inferences may embed social biases or overgeneralizations is crucial for responsible interpretation. Ethical awareness guides users to question the assumptions underlying LLM responses and to consider the social impact of the information they receive and disseminate.

4. Interaction Design (Section 4.2.4): The user’s ability to refine prompts and engage in iterative feedback loops directly influences the LLM’s subsequent linguistic combinations. By providing clear, context-rich prompts, users can guide the LLM towards generating more precise and relevant outputs, thereby actively shaping the raw material for meaning construction. This highlights the transactional nature of human-LLM communication, where both parties contribute to the evolving discourse.

5. Systemic Awareness (Section 4.2.5): Understanding the economic, regulatory, and provider-driven forces that shape LLM outputs is vital for accurate meaning construction. Awareness of compliance filters, access pricing, and the “dual-layer agenda setting” (Section 3.2.1.2) helps users contextualize LLM responses, preventing misplaced trust and clarifying the distributed responsibility across developers, providers, and users. This systemic understanding allows users to interpret LLM outputs not in isolation, but within the broader socio-technical ecosystem.

5.3.3 The Interplay: Bridging the Algorithmic and the Human

Meaning does not reside solely within the LLM’s output, nor is it solely a product of human cognition. Instead, it emerges from the dynamic interaction between the two. The LLM provides the potential for meaning through its statistically generated text, while the human user actualizes that meaning through interpretation, critical evaluation, and contextualization.

This bridging process is facilitated by:

1. Algorithmic Transparency (Ideal): While full transparency into LLM internal workings remains a challenge, understanding the principles of Transformer architecture, tokenization, and training methodologies (Chapter 3) allows users to develop a more informed mental model of the LLM. This reduces the “black box” effect and enables more realistic expectations of LLM capabilities and limitations.

2. Human-Centric Design: Developers and service providers have a crucial role in designing LLMs and their interfaces to facilitate responsible meaning construction. This includes implementing clear indicators of AI-generated content, providing tools for source verification, and designing systems that encourage critical engagement rather than passive consumption. The call for humanities interventions in the training pipeline (Section 3.1.5) directly supports embedding human-centered values at the algorithmic level, influencing the probabilistic outputs to be more aligned with ethical and societal norms.

3. Iterative Refinement and Feedback Loops: The continuous feedback loop between user and LLM, mediated by the service provider, allows for iterative refinement of meaning. Misinterpretations or unsatisfactory responses can be corrected through subsequent prompts, gradually aligning the co-constructed meaning. This process mirrors the transactional models of communication (Section 2.1.3) but with an explicit acknowledgment of the algorithmic partner.

In essence, the Meaning Construction Layer argues that effective human-LLM communication is a collaborative endeavor where the human user, equipped with LLM Literacy, actively engages with the LLM’s probabilistic outputs to construct meaningful interpretations. This layer moves beyond simply describing what LLMs do (generate text) to explaining how humans make sense of what LLMs do, thereby providing a robust framework for understanding the semantic and pragmatic dimensions of this novel communicative paradigm.

5.4 Addressing Theoretical Gaps and Answering Research Questions

5.4.1 Re-evaluating Traditional Communication Theories (RQ1)

To what extent can traditional communication theory explain the human-LLM interactions?

This hybrid model demonstrates that while these theories offer partial lenses, they are fundamentally insufficient to capture the full spectrum of this novel communicative paradigm. The Interaction Workflow Layer and Meaning Construction Layer collectively highlight where traditional models fall short and how our hybrid approach provides the necessary extensions.

Limitations of Linear Transmission Models (Shannon-Weaver): As discussed in Section 2.1.1, Shannon-Weaver models quantify token-level noise but fail to account for semantic meaning or distributed agency. Our Meaning Construction Layer directly addresses the semantic meaning by emphasizing that while LLM outputs are probabilistic linguistic combinations, the human user actively constructs meaning through interpretation and critical evaluation. The Interaction Workflow Layer explicitly recognizes the distributed agency across users, LLMs, developers, and service providers, moving beyond the fixed sender-receiver dichotomy. The hybrid model thus provides a framework where semantic fidelity and distributed influence are central, not peripheral, concerns.

Limitations of Interactive Communication Models (Schramm): Schramm’s models introduce feedback loops but presume symmetrical cognitive capabilities (Section 2.1.2). Our hybrid model acknowledges the iterative feedback loops within the Interaction Workflow Layer but explicitly accounts for the cognitive asymmetry between humans and LLMs. The Meaning Construction Layer highlights how the human user’s LLM Literacy is essential for bridging this gap, enabling them to critically evaluate the LLM’s probabilistic outputs rather than assuming shared understanding.

Limitations of Transactional Communication Models (Barnlund): While transactional models recognize co-construction and context-dependence, they assume human communicators with shared perceptual channels and do not adequately address technical constraints like memory loss (Section 2.1.3). Our hybrid model embraces the co-constructive nature of communication within the Meaning Construction Layer but grounds it in the technical realities of the Interaction Workflow Layer. The model explicitly incorporates LLM-specific limitations such as finite context windows (which lead to a form of non-human forgetting) and the potential for out-of-distribution hallucinations, providing a more comprehensive account of the transactional dynamics.

In essence, our hybrid model does not discard traditional theories but rather integrates their valuable insights into a broader, more technologically informed framework. It demonstrates that while traditional theories can explain certain aspects of human-LLM interactions, a comprehensive understanding requires a hybrid approach that accounts for distributed agency, cognitive asymmetry, technical constraints, and the unique process of meaning construction in this new communicative context.

5.4.2 Constructing a Hybrid Model (RQ2)

What hybrid components best model human-LLM communication dynamics?

Our hybrid model directly answers this question by proposing a two-layered structure that synthesizes elements from computer science, communication studies, and media studies.

The Interaction Workflow Layer: This layer is a crucial component for modeling the distributed agency and multi-faceted mediation inherent in human-LLM communication. By identifying the four key entities (Human User, LLM, Developer and Service Provider) and mapping their interdependencies, this layer provides a comprehensive blueprint of the communication pipeline. This moves beyond simplistic models to capture the complex socio-technical system in which human-LLM interactions are embedded. The concept of triple-layer agenda setting (Section 3.2.1.2), where user prompts set the first-level agenda, system prompts/RAG set the second-level agenda and the indirect macro-level, the third agenda (accumulated from the second-level shaped among conversations of all users) is a prime example of a hybrid component that accurately models the subtle yet powerful influence of algorithmic mediation.

The Meaning Construction Layer: This directly addresses the epistemological challenge of meaning in human-LLM communication. By positing that meaning is a co-constructed phenomenon—emerging from the interplay between the LLM’s probabilistic linguistic combinations and the human user’s interpretive framework—it bridges the gap between technical functionality and humanistic understanding. The integration of LLM Literacy competencies (Technical Understanding, Critical Evaluation, Ethical Awareness, Interaction Design, Systemic Awareness) within this layer provides the necessary human-centric tools for navigating the complexities of algorithmic meaning-making. This layer acknowledges that while LLMs generate text, humans ultimately construct meaning, thereby reclaiming the user’s central role.

Integration of Technical and Humanistic Elements: The hybrid model’s strength lies in its seamless integration of insights from Chapter 3 (technical mechanisms) and Chapter 4 (LLM Literacy). For instance, understanding the LLM’s context window (Section 3.1.1.3) informs the Critical Evaluation competency (Section 4.2.2) by highlighting the limitations of long-term memory. Similarly, the call for humanities interventions in the training pipeline (Section 3.1.5) directly feeds into the ethical considerations within the Meaning Construction Layer, advocating for the embedding of human-centered values at the algorithmic level. This interdisciplinary synthesis is a key hybrid component that allows for a comprehensive and actionable understanding of human-LLM communication.

5.4.3 Benefits and Risks of Treating LLMs as Communicative Agents (RQ3)

What benefits or risks emerge from treating LLMs as communicative agents?

This hybrid model suggests that while users may naturally anthropomorphize LLMs (Section 1.2.2), a critical understanding of the underlying mechanisms is essential for mitigating risks and harnessing benefits.

Benefits: The perception of LLMs as communicative agents can enhance user engagement, facilitate more natural and intuitive interactions, and promote co-creative partnerships. The Meaning Construction Layer acknowledges that users actively co-construct meaning with LLMs, and a degree of perceived agency can make this process more fluid and productive. For example, in educational or therapeutic contexts, a perceived sense of partnership can foster trust and encourage deeper engagement. The model suggests that the benefits of treating LLMs as agents are realized when users are equipped with high levels of LLM Literacy, allowing them to engage critically and ethically.

Risks: The primary risk of treating LLMs as communicative agents without a critical understanding is the potential for over-trust, emotional dependency, and susceptibility to misinformation and manipulation. This hybrid model highlights these risks through several components: The Interaction Workflow Layer reveals the hidden mediation by service providers, demonstrating that the “agent” the user interacts with is not a singular entity but a product of multiple influences. This awareness helps mitigate the risk of misplaced trust in a seemingly autonomous agent. The Meaning Construction Layer emphasizes the probabilistic nature of LLM outputs, reminding users that the LLM is not an intentional, truth-seeking agent. This understanding is crucial for combating the risk of accepting hallucinations or biased information at face value. The LLM Literacy competencies, particularly Critical Evaluation and Ethical Awareness, provide the necessary tools to deconstruct the illusion of agency and to critically assess the LLM’s outputs. By understanding that the LLM is a sophisticated linguistic tool rather than a conscious entity, users can better manage their expectations and avoid the pitfalls of uncritical anthropomorphism.

In conclusion, our hybrid model suggests that the question is not whether to treat LLMs as agents, but rather how to engage with them given our natural tendency to do so. The model advocates for a form of “critical anthropomorphism”, where users can leverage the benefits of a more natural interaction style while remaining critically aware of the underlying technical mechanisms and socio-technical influences. This nuanced approach, grounded in the dual layers of this model, provides a robust framework for promoting responsible and beneficial human-LLM communication.

This hybrid model serves as a foundational theoretical framework for future research into human-LLM communication. It provides a lens through which scholars can analyse the evolving dynamics of human-AI interaction, offering insights for the responsible design, development, and deployment of LLM technologies. By emphasizing the interdisciplinary nature of this field, the model underscores the imperative for continued collaboration between computer science, communication studies, and other humanities disciplines to ensure that the advancement of AI serves human-centered values and promotes effective, ethical, and equitable communication in the age of artificial intelligence.

6 Discussion

This chapter extends the hybrid communication model proposed in Chapter 5 to analyse six interconnected issues that are both theoretically and practically significant for human–LLM communication. These issues—ranging from specialized interventions to governance in private deployments—are considered through the model’s dual-layer lens: the Interaction Workflow Layer, which maps distributed agency, and the Meaning Construction Layer, which foregrounds user agency and literacy in co-creating meaning.

6.1 Specialised LLMs for Large-Scale Conversation Interventions

As discussed in Sections 3.1.4 and 3.2.1, large-scale LLM deployments generate an immense volume of user–model dialogues that cannot be exhaustively reviewed through human oversight, while the absence of scalable evaluation mechanisms risks the persistence of undetected hallucinations, value misalignments, and harmful content. One promising approach is the development of specialised auditor LLMs, trained specifically to evaluate and annotate conversation logs for policy compliance, semantic fidelity, and risk indicators.

From the hybrid model’s perspective, such auditor systems would act as supervisor-agents within the Interaction Workflow Layer—operating between raw model outputs and human review. They could incorporate multi-agent debate and self-consistency checking (Chan et al., 2023) to detect anomalies across millions of sessions. However, recursive reliability concerns arise: the auditing model’s biases, alignment choices, and mediation mechanisms must themselves be transparent and subject to independent verification, potentially through cross-model consensus or selective human-in-the-loop escalation in high-stakes domains.

6.2 LLM Literacy Promotion and the “Starting Point” Principle

Chapter 4 introduced LLM Literacy as a user-side framework essential for sustaining meaningful agency in hybrid interactions. The literacy model suggests that LLM outputs should be treated as starting points for inquiry rather than definitive conclusions. Embedding this principle into educational policy, public workshops, and LLM application competitions could operationalise the model’s normative commitment to user empowerment.

Competitions, in particular, align with the Meaning Construction Layer by requiring participants to: (1) iteratively refine prompts, (2) cross-verify outputs from multiple models, and (3) explicitly justify decision-making based on LLM-generated suggestions. Such structured engagement reinforces probabilistic epistemology, counteracts automation bias (Parasuraman and Manzey, 2010), and cultivates the evaluative habits needed to resist over-trust in algorithmic authority, while exploring novel LLM application scenarios.

6.3 Distributed Agency in Multi-Agent Collaboration

Multi-agent orchestration—whether for collaborative writing, multi-step reasoning, or task automation—reconfigures the Interaction Workflow Layer by replacing the singular “LLM” node with a network of interacting agents. Each agent may have distinct training provenance, parameter settings, and mediation filters, leading to complex and potentially opaque agency chains.

This distribution of agency complicates attribution in both technical and regulatory contexts (Custers et al., 2025). Errors or biases may emerge from the interaction between agents rather than any single component. Adapting the hybrid model for multi-agent settings thus requires an additional provenance tracking sub-layer capable of recording, visualising, and auditing which agent contributed to each stage of the communicative act—an approach that could integrate with the auditability requirements of high-risk AI systems under the EU AI Act (2024b).

6.4 Hybrid Model Applicability to New Architectures (MoE, LLaDA)

Emerging architectures such as Mixture-of-Experts (MoE) and Large Language Diffusion Models (LLaDA) introduce novel internal decision-making processes. In MoE architectures, only a subset of expert sub-networks is activated per query (Shazeer et al., 2017), effectively creating a selective mediation mechanism within the LLM entity itself. LLaDA, by contrast, generate text through iterative denoising (Nie et al., 2025), (from “words by words” to “global to detail” generating from traditional LLM) potentially enhancing global coherence but increasing latency.

The hybrid model remains applicable to these architectures, but the “LLM” node in the Interaction Workflow Layer must be internally decomposed to account for intra-model selection and refinement stages. This decomposition enables analysis of how expert routing or iterative generation interacts with semantic noise, hallucination risk, and value encoding—ensuring that the model continues to capture agency distribution both within and outside the algorithmic core.

6.5 Regional Values, Internet Governance, and LLM Regulation

The hybrid model’s treatment of provider mediation (Section 3.2.1.2) offers a useful lens for understanding regional adaptation of communicative values. Both intervening mediating layers and training LLMs with curated dataset can reflect output (Section 3.1, 3.2.1.2) with jurisdiction-specific laws and cultural norms, embedding local priorities.

While such localisation can enhance cultural relevance and regulatory compliance, it also risks creating information fragmentation: users in different regions may receive systematically different framings or even contradictory factual claims for identical prompts.

6.6 Governance Challenges in Private Deployment and Responsible Open-Sourcing

Private deployments of open-source LLMs bypass the centralised mediation and compliance mechanisms maintained by commercial providers, shifting governance burdens to the deployer. This decentralisation enhances user autonomy and privacy but also removes safety interventions that can mitigate harmful outputs or disinformation.

The hybrid model can guide the design of embedded governance mechanisms for open-source releases—such as optional alignment modules, provenance metadata for outputs, and audit APIs that allow third parties to inspect operational logs without compromising user privacy. Responsible open-sourcing therefore requires not only permissive licensing but also a scalable governance toolkit that aligns with distributed agency principles while preserving the benefits of decentralisation and mitigating the risks of technological abuse.

6.7 Conclusion

Across these six thematic areas, this hybrid model demonstrates its analytical flexibility in accommodating new technical architectures, shifting governance regimes, and evolving interaction patterns. Its dual-layer structure enables simultaneous attention to infrastructural mediation (Interaction Workflow Layer) and user meaning-making practices (Meaning Construction Layer). By foregrounding distributed agency, probabilistic epistemology, and literacy-driven user empowerment, this model provides a foundation for both critical diagnosis and constructive design in the rapidly evolving landscape of human–LLM communication.

7 Conclusion

This dissertation has addressed a critical gap in our theoretical understanding of human-LLM interactions by developing a novel hybrid communication model specifically designed for these increasingly ubiquitous interactions. As LLMs become embedded in everyday communication practices, traditional communication theories have proven inadequate for explaining the unique dynamics that emerge when humans interact with probabilistic, AI-powered language systems.

7.1 Summary of Key Findings

This research has demonstrated four critical findings. First, existing communication theories are fundamentally inadequate for human-LLM interactions. Linear models cannot account for probabilistic generation, interactive models assume cognitive symmetry that doesn't exist between humans and LLMs, and transactional models remain anchored to human-centric assumptions about shared experience and sensory grounding.

Second, the hybrid communication model represents this dissertation's central theoretical contribution—a two-layer framework that addresses the entire interaction pipeline while specifically focusing on meaning construction where tokens are assigned semantic significance through the intersection of technical mechanisms and human interpretive processes. This model accommodates both the deterministic aspects of LLM architecture and the interpretive flexibility humans bring to these interactions.

Third, effective human-LLM communication requires specialized literacy competencies beyond traditional digital literacies. The proposed five-stage framework (Recognize, Resonate, Reflect, Responsible, Refine) provides structured approaches for maintaining human agency while leveraging LLM capabilities appropriately.

Fourth, the research confirms that interdisciplinary collaboration is essential for understanding and developing human-LLM communication systems. Technical approaches alone cannot address the communicative, cultural, and social dimensions of these interactions.

7.2 Addressing the Research Questions

The primary research question asked whether existing communication theories could adequately explain human-LLM interactions. The analysis provides a definitive negative answer—each examined framework exhibits fundamental limitations that represent categorical mismatches between theoretical assumptions and empirical realities.

Regarding the unique characteristics of human-LLM communication, this research identified several distinctive features: dynamic mental model shifts where users move from tool-oriented to anthropomorphic interaction patterns; mediated authenticity challenges created by RLHF and alignment processes; probabilistic interpretation difficulties where confident presentation styles mask statistical generation; and context window constraints that create novel memory management challenges.

7.3 Contributions to Knowledge

This dissertation makes several distinct contributions. Theoretically, it provides the first comprehensive communication model specifically designed for human-LLM interactions, addressing gaps in existing theory while providing practical analytical tools. Methodologically, it demonstrates successful integration of technical computer science concepts with humanistic communication theory. Practically, it offers actionable frameworks for literacy development and educational implementation. Critically, it reveals how cultural biases become embedded in RLHF processes, contributing to scholarship on AI fairness from a communicative perspective. Regulatory frameworks must distinguish between functional agency (requiring technical audits of developers/providers) and attributed agency (requiring user safeguards against anthropomorphic over-reliance).

7.4 Limitations and Future Directions

This research employed primarily theoretical analysis rather than original experimental investigation. Future research should empirically validate the hybrid model's predictive capabilities across diverse user populations and use contexts. The focus on text-based, English-language LLMs may limit cross-cultural applicability, suggesting need for cultural adaptation studies.

Several promising research directions emerge: longitudinal studies tracking how user interaction patterns evolve with extended LLM use; LLM literacy model needs further empirical examples (e.g., studies showing improved outcomes after literacy training) to strengthen claims; empirical research on mediating layers demands a sophisticated observing framework; observations on anthropomorphism would be better if supported with contradictory studies; language-specified investigations of interaction pattern variations; multimodal extensions encompassing visual and audio AI systems; organizational implementation studies examining institutional deployment contexts; and regulatory framework development translating theoretical insights into policy mechanisms.

Additional limitations include the lack of empirical validation of the proposed hybrid model and the rapidly evolving nature of LLM technology which may render some theoretical insights obsolete.

7.5 Final Reflections

This research emerges at a pivotal moment when LLMs are becoming increasingly sophisticated and ubiquitous. The findings suggest that the future of human-AI communication depends not primarily on technical advances but on developing appropriate theoretical frameworks, educational approaches, and institutional practices that preserve human agency while harnessing AI capabilities.

The hybrid model offers one approach to these challenges, but represents the beginning rather than end of necessary theoretical development. Most importantly, this research demonstrates that human-AI communication questions are not merely technical challenges but fundamentally humanistic inquiries requiring sustained attention to meaning, culture, value, and agency.

The interdisciplinary collaboration called for throughout this dissertation represents a practical necessity for developing AI systems that enhance rather than diminish human communicative capacities. As we integrate increasingly sophisticated AI into human communication, the frameworks developed here provide tools for ensuring this integration serves human purposes while preserving human values. The ultimate success of these frameworks depends on their adoption and continued development by diverse communities of practice, requiring adaptive, critical, and humane approaches to navigating our evolving technological landscape while preserving what is most valuable in human communicative experience.

References

2022. Natural Language Processing [Online]. National Library of Medicine. Available: https://www.nnlm.gov/guides/data-glossary/natural-language-processing [Accessed].

2023. 生成式人工智能服务管理暂行办法 [Interim Measures for the Management of Generative Artificial Intelligence Services, translated by Boliang Wang]. In: NDRC, MOE, MOST, MIIT, MPS & NRTA (eds.). Beijing: Cyberspace Administration of China.

2024a. Introducing Llama 3.1: Our most capable models to date [Online]. Meta AI. Available: https://ai.meta.com/blog/meta-llama-3-1/ [Accessed August 8 2025].

2024b. Regulation 2024/1689 of the Eur. Parl. & Council of June 13, 2024 (EU Artificial Intelligence Act). International Legal Materials, 1-148.

2025. Artificial Intelligence Playbook for the UK Government, Government Digital Service.

0XAB1D. 2024. ChatGPT GPT Plugins Leaked System Instructions Prompts [Online]. GitHub. Available: https://github.com/0xAb1d/GPTsSystemPrompts [Accessed August 8 2025].

ACADEMY, K. April 3, 2025 2025. Khan Academy’s Framework for Responsible AI in Education [Online]. Available from: https://blog.khanacademy.org/khan-academys-framework-for-responsible-ai-in-education/ [Accessed August 10 2025].

ACHIAM, J., ADLER, S., AGARWAL, S., AHMAD, L., AKKAYA, I., ALEMAN, F. L., ALMEIDA, D., ALTENSCHMIDT, J., ALTMAN, S. & ANADKAT, S. 2023. Gpt-4 technical report. arXiv preprint arXiv:2303.08774.

AFROOGH, S., AKBARI, A., MALONE, E., KARGAR, M. & ALAMBEIGI, H. 2024. Trust in AI: progress, challenges, and future directions. Humanities and Social Sciences Communications, 11, 1568.

ALZAABI, A., ALAMRI, A., ALBALUSHI, H., ALJABRI, R. & AALABDULSALAM, A. 2023. ChatGPT applications in Academic Research: A Review of Benefits, Concerns, and Recommendations. bioRxiv, 2023.08.17.553688.

AMAZON. What is LLM (Large Language Model)? [Online]. Amazon. Available: https://aws.amazon.com/what-is/large-language-model/ [Accessed May 2 2025].

AOYAMA, S., CHUJO, R., HAUTASAARI, A. & NAEMURA, T. 2025. Intersubjective Model of AI-mediated Communication: Augmenting Human-Human Text Chat through LLM-based Adaptive Agent Pair. arXiv preprint arXiv:2502.18201.

ATIL, B., AYKENT, S., CHITTAMS, A., FU, L., PASSONNEAU, R. J., RADCLIFFE, E., RAJAGOPAL, G. R., SLOAN, A., TUDREJ, T. & TURE, F. 2024. Non-determinism of" deterministic" llm settings. arXiv preprint arXiv:2408.04667.

AUSTIN, J. L. 1975. How to do things with words, Harvard university press.

BARNLUND, D. C. 2017. A transactional model of communication. Communication theory. Routledge.

BASHARDOUST, A., FENG, Y., GEISSLER, D., FEUERRIEGEL, S. & SHRESTHA, Y. R. 2024. The Effect of Education in Prompt Engineering: Evidence from Journalists. arXiv preprint arXiv:2409.12320.

BENDER, E. M., GEBRU, T., MCMILLAN-MAJOR, A. & SHMITCHELL, S. 2021. On the Dangers of Stochastic Parrots. FAccT '21: 2021 ACM Conference on Fairness, Accountability, and Transparency. Virtual Event Canada.

BENGIO, Y., MINDERMANN, S. & PRIVITERA, D. 2025. International AI safety report 2025.

BERGMANN, D. 2023. What is self-supervised learning? [Online]. IBM. Available: https://www.ibm.com/think/topics/self-supervised-learning [Accessed 4 August 2025].

BERGMANN, D. 2024. What is a context window? [Online]. IBM. Available: https://www.ibm.com/think/topics/context-window [Accessed 30 July, 2025 2025].

BROWN, T., MANN, B., RYDER, N., SUBBIAH, M., KAPLAN, J. D., DHARIWAL, P., NEELAKANTAN, A., SHYAM, P., SASTRY, G. & ASKELL, A. 2020. Language models are few-shot learners. Advances in neural information processing systems, 33, 1877-1901.

CAPEL, T. & BRERETON, M. 2023. What is Human-Centered about Human-Centered AI? A Map of the Research Landscape. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. Hamburg, Germany: Association for Computing Machinery.

CASTELFRANCHI, C. & FALCONE, R. 1998. Towards a theory of delegation for agent-based systems. Robotics and Autonomous systems, 24, 141-157.

CHAN, C.-M., CHEN, W., SU, Y., YU, J., XUE, W., ZHANG, S., FU, J. & LIU, Z. 2023. Chateval: Towards better llm-based evaluators through multi-agent debate. arXiv preprint arXiv:2308.07201.

CHANG, T. A., RAJAGOPAL, D., BOLUKBASI, T., DIXON, L. & TENNEY, I. 2024. Scalable influence and fact tracing for large language model pretraining. arXiv preprint arXiv:2410.17413.

CHARLESWORTH, A., FOTHERINGHAM, K., GAVAGHAN, C., SANCHEZ-GRAELLS, A. & TORRIBLE, C. 2023. Response to the UK’s March 2023 White Paper ‘A Pro-innovation Approach to AI Regulation’. June.

CHEN, C. & SHU, K. 2023. Can llm-generated misinformation be detected? arXiv preprint arXiv:2309.13788.

CHRISTOPHER, D. M., PRABHAKAR, R. & HINRICH, S. 2008. Introduction to information retrieval. Cambridge University Press.

CLARK, K., KHANDELWAL, U., LEVY, O. & MANNING, C. D. 2019. What does bert look at? an analysis of bert's attention. arXiv preprint arXiv:1906.04341.

COHEN, B. C. 2015. Press and foreign policy.

COLOMBATTO, C., BIRCH, J. & FLEMING, S. M. 2025. The influence of mental state attributions on trust in large language models. Communications Psychology, 3, 84.

COLOMBATTO, C. & FLEMING, S. M. 2024. Folk psychological attributions of consciousness to large language models. Neuroscience of Consciousness, 2024.

CUKUROVA, M. & MIAO, F. 2024. AI competency framework for teachers, UNESCO Publishing.

CUSKLEY, C., WOODS, R. & FLAHERTY, M. 2024. The limitations of large language models for understanding human language and cognition. Open Mind, 8, 1058-1083.

CUSTERS, B., LAHMANN, H. & SCOTT, B. I. 2025. From liability gaps to liability overlaps: shared responsibilities and fiduciary duties in AI and other complex technologies. AI Soc, 40, 4035-4050.

DAHLGREN LINDSTRöM, A., METHNANI, L., KRAUSE, L., ERICSON, P., DE RITUERTO DE TROYA Í, M., COELHO MOLLO, D. & DOBBE, R. 2025. Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback. Ethics Inf Technol, 27, 28.

DONG, G., YUAN, H., LU, K., LI, C., XUE, M., LIU, D., WANG, W., YUAN, Z., ZHOU, C. & ZHOU, J. 2023. How abilities in large language models are affected by supervised fine-tuning data composition. arXiv preprint arXiv:2310.05492.

DUBEY, A., JAUHRI, A., PANDEY, A., KADIAN, A., AL-DAHLE, A., LETMAN, A., MATHUR, A., SCHELTEN, A., YANG, A. & FAN, A. 2024. The llama 3 herd of models. arXiv e-prints, arXiv: 2407.21783.

GAO, Z., CHANG, J., ZHAN, W., OERTELL, O., SWAMY, G., BRANTLEY, K., JOACHIMS, T., BAGNELL, D., LEE, J. D. & SUN, W. 2024. Rebel: Reinforcement learning via regressing relative rewards. Advances in Neural Information Processing Systems, 37, 52354-52400.

GHAI, B. 2023. Towards fair and explainable ai using a human-centered ai approach. State University of New York at Stony Brook.

GOOGLE. 2024. Introduction to Large Language Models [Online]. Google. Available: https://developers.google.com/machine-learning/resources/intro-llms [Accessed May 2 2025].

GOOGLE. 2025a. Gemini Developer API Pricing [Online]. Available: https://ai.google.dev/gemini-api/docs/pricing [Accessed August 8 2025].

GOOGLE. 2025b. Google AI Plans and Features [Online]. Available: https://one.google.com/about/google-ai-plans/?hl=en-GB#plans-table [Accessed August 7 2025].

GORWA, R., BINNS, R. & KATZENBACH, C. 2020. Algorithmic content moderation: Technical and political challenges in the automation of platform governance. Big Data & Society, 7, 2053951719897945.

GUZMAN, A. L. 2018. What is human-machine communication, anyway. Human-machine communication: Rethinking communication, technology, and ourselves, 117, 1-28.

GUZMAN, A. L. & LEWIS, S. C. 2020. Artificial intelligence and communication: A human–machine communication research agenda. New media & society, 22, 70-86.

HAVRILLA, A. & IYER, M. 2024. Understanding the effect of noise in llm training data with algorithmic chains of thought. arXiv preprint arXiv:2402.04004.

HU, E. J., SHEN, Y., WALLIS, P., ALLEN-ZHU, Z., LI, Y., WANG, S., WANG, L. & CHEN, W. 2022. Lora: Low-rank adaptation of large language models. ICLR, 1, 3.

HU, K. 2023. ChatGPT sets record for fastest-growing user base - analyst note. Reuters.

IBM. 2023. What are large language models (LLMs)? [Online]. IBM. Available: https://www.ibm.com/think/topics/large-language-models [Accessed May 1 2025].

IBM. 2024. What is NLP (natural language processing)? [Online]. IBM. Available: https://www.ibm.com/think/topics/natural-language-processing [Accessed May 2 2025].

JACKSON, J. K., STARR, J. & WEAVER, D. A. 2024. A Framework for Digital Equity. Digital Promise.

KAPLAN, J., MCCANDLISH, S., HENIGHAN, T., BROWN, T. B., CHESS, B., CHILD, R., GRAY, S., RADFORD, A., WU, J. & AMODEI, D. 2020. Scaling laws for neural language models. arXiv preprint arXiv:2001.08361.

KAUFMANN, A. 2025. Understanding the Total Cost of Inferencing Large Language Models. Enterprise Strategy Group.

KAUFMANN, T., WENG, P., BENGS, V. & HüLLERMEIER, E. 2024. A survey of reinforcement learning from human feedback.

KERNER, S. M. 2024. What are large language models (LLMs)? [Online]. TechTarget. Available: https://www.techtarget.com/whatis/definition/large-language-model-LLM [Accessed May 1 2025].

KHARCHENKO, J., ROOSTA, T., CHADHA, A. & SHAH, C. 2024. How well do llms represent values across cultures? empirical analysis of llm responses based on hofstede cultural dimensions. arXiv preprint arXiv:2406.14805.

KIRK, H. R., WHITEFIELD, A., ROTTGER, P., BEAN, A. M., MARGATINA, K., MOSQUERA-GOMEZ, R., CIRO, J., BARTOLO, M., WILLIAMS, A. & HE, H. 2024. The PRISM alignment dataset: What participatory, representative and individualised human feedback reveals about the subjective and multicultural alignment of large language models. Advances in Neural Information Processing Systems, 37, 105236-105344.

KUMAR, A., ZHUANG, V., AGARWAL, R., SU, Y., CO-REYES, J. D., SINGH, A., BAUMLI, K., IQBAL, S., BISHOP, C. & ROELOFS, R. 2024. Training language models to self-correct via reinforcement learning. arXiv preprint arXiv:2409.12917.

LAKE, R. 2023. ‘This Changes Everything’: AI Is About to Upend Teaching and Learning [Online]. The 74 Media, Inc. Available: https://www.the74million.org/article/this-changes-everything-ai-is-about-to-upend-teaching-and-learning/ [Accessed August 10 2025].

LANCIERI, F., EDELSON, L. & BECHTOLD, S. 2025. AI regulation: Competition, arbitrage and regulatory capture. Theoretical Inquiries in Law, 26, 239-262.

LECUN, Y. & MISRA, I. 2021. Self-supervised learning: The dark matter of intelligence [Online]. Meta AI. Available: https://ai.meta.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/ [Accessed August 4 2025].

LEE, E.-J. 2023. Minding the source: toward an integrative theory of human–machine communication. Human Communication Research, 50, 184-193.

LEWIS, P., PEREZ, E., PIKTUS, A., PETRONI, F., KARPUKHIN, V., GOYAL, N., KüTTLER, H., LEWIS, M., YIH, W.-T. & ROCKTäSCHEL, T. 2020. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in neural information processing systems, 33, 9459-9474.

LEWIS, S. C., GUZMAN, A. L. & SCHMIDT, T. R. 2019. Automation, journalism, and human–machine communication: Rethinking roles and relationships of humans and machines in news. Digital journalism, 7, 409-427.

LI, Z., XU, S., MEI, K., HUA, W., RAMA, B., RAHEJA, O., WANG, H., ZHU, H. & ZHANG, Y. 2024. Autoflow: Automated workflow generation for large language model agents. arXiv preprint arXiv:2407.12821.

LIN, Y.-C., NEVILLE, J., STOKES, J. W., YANG, L., SAFAVI, T., WAN, M., COUNTS, S., SURI, S., ANDERSEN, R. & XU, X. 2024. Interpretable user satisfaction estimation for conversational systems with large language models. arXiv preprint arXiv:2403.12388.

LIU, A., FENG, B., XUE, B., WANG, B., WU, B., LU, C., ZHAO, C., DENG, C., ZHANG, C. & RUAN, C. 2024a. Deepseek-v3 technical report. arXiv preprint arXiv:2412.19437.

LIU, T., YAO, H., WU, T., QIN, Z., LIN, F., REN, K. & CHEN, C. 2024b. Mitigating privacy risks in LLM embeddings from embedding inversion. arXiv preprint arXiv:2411.05034.

LIVINGSTONE, S. 2003. The changing nature and uses of media literacy.

LLOBERA, J. & VALLVERDú, J. 2021. MaTIC: a Mathematical Theory of Inferential Communication. arXiv preprint arXiv:2111.06288.

LONG, D. & MAGERKO, B. What is AI literacy? Competencies and design considerations. Proceedings of the 2020 CHI conference on human factors in computing systems, 2020. 1-16.

LUO, X. 2025. Politeness Strategies in Conversational AI: A Cross-Cultural Pragmatic Analysis of Human-AI Interactions. Pinnacle Academic Press Proceedings Series, 3, 1-14.

LYMAN, A., HEPNER, B., ARGYLE, L. P., BUSBY, E. C., GUBLER, J. R. & WINGATE, D. 2025. Balancing Large Language Model Alignment and Algorithmic Fidelity in Social Science Research. Sociological Methods & Research, 54, 1110-1155.

MA, S., SALINAS, A., NYARKO, J. & HENDERSON, P. Breaking Down Bias: On The Limits of Generalizable Pruning Strategies. Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency, 2025. 2437-2450.

MAHARANA, A., LEE, D.-H., TULYAKOV, S., BANSAL, M., BARBIERI, F. & FANG, Y. 2024. Evaluating very long-term conversational memory of llm agents. arXiv preprint arXiv:2402.17753.

MALBERG, S., POLETUKHIN, R., SCHUSTER, C. M. & GROH, G. 2024. A comprehensive evaluation of cognitive biases in LLMs. arXiv preprint arXiv:2410.15413.

MAPLE, C., SABUNCUOGLU, A., SZPRUCH, L., ELLIOT, A., REINERT, G. & ZEMAITIS, T. 2024. The impact of large language models in finance: Towards trustworthy adoption. The Alan Turing Institute.

MARSOOF, A., LUCO, A., TAN, H. & JOTY, S. 2023. Content-filtering AI systems–limitations, challenges and regulatory approaches. Information & Communications Technology Law, 32, 64-101.

MASLEJ, N., FATTORINI, L., PERRAULT, R., GIL, Y., PARLI, V., KARIUKI, N., CAPSTICK, E., REUEL, A., BRYNJOLFSSON, E., ETCHEMENDY, J., LIGETT, K., LYONS, T., MANYIKA, J., NIEBLES, J. C., SHOHAM, Y., WALD, R., WALSH, T., HAMRAH, A., SANTARLASCI, L., LOTUFO, J. B., ROME, A., SHI, A. & OAK, S. 2025. The AI Index 2025 Annual Report. Stanford, CA: AI Index Steering Committee, Institute for Human-Centered AI.

MCCOMBS, M. E. & SHAW, D. L. 1972. The agenda-setting function of mass media. Public opinion quarterly, 36, 176-187.

MCMAHON, L. 2025. BBC threatens AI firm with legal action over unauthorised content use [Online]. BBC. Available: https://www.bbc.co.uk/news/articles/cy7ndgylzzmo [Accessed August 8 2025].

MIAO, F. & SHIOHIRA, K. 2024. AI competency framework for students, UNESCO Publishing.

MOLNAR, C. 2020. Interpretable machine learning, Lulu. com.

NEUMANN, A., KIRSTEN, E., ZAFAR, M. B. & SINGH, J. Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs). Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency, 2025. 573-598.

NIE, S., ZHU, F., YOU, Z., ZHANG, X., OU, J., HU, J., ZHOU, J., LIN, Y., WEN, J.-R. & LI, C. 2025. Large language diffusion models. arXiv preprint arXiv:2502.09992.

NIU, K. & ZHANG, P. 2024. A Mathematical Theory of Semantic Communication: Overview. Available: https://ui.adsabs.harvard.edu/abs/2024arXiv240114160N [Accessed January 01, 2024].

NVIDIA. What are Large Language Models? [Online]. Available: https://www.nvidia.com/en-gb/glossary/large-language-models/ [Accessed May 2 2025].

OECD 2023. Regulatory sandboxes in artificial intelligence. OECD Digital Economy Papers. Paris.

OECD 2025. Empowering learners for the age of AI: An AI literacy framework for primary and secondary education (Review draft). Paris: OECD.

OPENAI. 2025a. ChatGPT Pricing [Online]. OpenAI. Available: https://openai.com/chatgpt/pricing/ [Accessed August 7 2025].

OPENAI. 2025b. Pricing - OpenAI API [Online]. OpenAI. Available: https://platform.openai.com/docs/pricing [Accessed August 8 2025].

OUYANG, L., WU, J., JIANG, X., ALMEIDA, D., WAINWRIGHT, C., MISHKIN, P., ZHANG, C., AGARWAL, S., SLAMA, K. & RAY, A. 2022. Training language models to follow instructions with human feedback. Advances in neural information processing systems, 35, 27730-27744.

PARASURAMAN, R. & MANZEY, D. H. 2010. Complacency and bias in human use of automation: an attentional integration. Hum Factors, 52, 381-410.

PAREJA, A., NAYAK, N. S., WANG, H., KILLAMSETTY, K., SUDALAIRAJ, S., ZHAO, W., HAN, S., BHANDWALDAR, A., XU, G. & XU, K. 2024. Unveiling the secret recipe: A guide for supervised fine-tuning small llms. arXiv preprint arXiv:2412.13337.

PARK, P. S., SCHOENEGGER, P. & ZHU, C. 2024. Diminished diversity-of-thought in a standard large language model. Behavior Research Methods, 56, 5754-5770.

PEEPERKORN, M., KOUWENHOVEN, T., BROWN, D. & JORDANOUS, A. 2024. Is temperature the creativity parameter of large language models? arXiv preprint arXiv:2405.00492.

QU, J., HU, K., ZHU, J., LI, W., WANG, T., CHEN, Z., YE, Y., LU, C., ZHOU, A. & WANG, X. 2025. Epitome: Pioneering an Experimental Platform for AI-Social Science Integration. arXiv preprint arXiv:2507.01061.

RADFORD, A., NARASIMHAN, K., SALIMANS, T. & SUTSKEVER, I. 2018. Improving language understanding by generative pre-training.

RADFORD, A., WU, J., CHILD, R., LUAN, D., AMODEI, D. & SUTSKEVER, I. 2019. Language models are unsupervised multitask learners. OpenAI blog, 1, 9.

RAMMERT, W. 2012. Distributed Agency and Advanced Technology. Or: How to Analyze Constellations of Collective Inter-Agency 1. 1 ed.: Routledge.

RAWAT, M., GUPTA, A., GOOMER, R., DI BARI, A., GUPTA, N. & PIERACCINI, R. 2025. Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents. arXiv preprint arXiv:2505.09970.

REEVES, B. & NASS, C. 1996. The media equation: How people treat computers, television, and new media like real people. Cambridge, UK, 10, 19-36.

REZWANA, J. & FORD, C. Improving User Experience with FAICO: Towards a Framework for AI Communication in Human-AI Co-Creativity. Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025. 1-9.

SAGGU, K. S. & DAS, A. 2025. Exploring PLeak: An Algorithmic Method for System Prompt Leakage [Online]. Trend Micro. Available: https://www.trendmicro.com/en_us/research/25/e/exploring-pleak.html [Accessed].

SALEHI, S., EROL-KANTARCI, M. & NIYATO, D. 2025. Llm-enabled data transmission in end-to-end semantic communication. arXiv preprint arXiv:2504.07431.

SARTOR, G., SARTOR, G. & LOREGGIA, A. 2020. The impact of algorithms for online content filtering or moderation. Upload filters.

SCHNEIDER, J., FLORES, A. C. & KRANZ, A.-C. 2024. Exploring Human-LLM conversations: Mental models and the originator of toxicity. arXiv preprint arXiv:2407.05977.

SCHRAM, W. E. 1954. The process and effects of mass communication.

SHANNON, C. E. 1948. A mathematical theory of communication. The Bell system technical journal, 27, 379-423.

SHAZEER, N., MIRHOSEINI, A., MAZIARZ, K., DAVIS, A., LE, Q., HINTON, G. & DEAN, J. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538.

SHIEH, E., VASSEL, F.-M., SUGIMOTO, C. & MONROE-WHITE, T. 2024. Laissez-faire harms: Algorithmic biases in generative language models. arXiv preprint arXiv:2404.07475.

SINGLA, A., SUKHAREVSKY, A., YEE, L., CHUI, M. & HALL, B. 2025. The state of AI: How organizations are rewiring to capture value. McKinsey & Company, 12.

SORICH, M. J., MANGONI, A. A., BACCHI, S., MENZ, B. D. & HOPKINS, A. M. 2024. The Triage and Diagnostic Accuracy of Frontier Large Language Models: Updated Comparison to Physician Performance. J Med Internet Res, 26, e67409.

SUKHAREVSKY, A., KERR, D., HJARTAR, K., HäMäLäINEN, L., BOUT, S., LEO, V. D. & DAGORRET, G. 2025. Seizing the agentic AI advantage. London: McKinsey & Company.

TEAM, G., GEORGIEV, P., LEI, V. I., BURNELL, R., BAI, L., GULATI, A., TANZER, G., VINCENT, D., PAN, Z. & WANG, S. 2024. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. arXiv preprint arXiv:2403.05530.

TICHENOR, P. J., DONOHUE, G. A. & OLIEN, C. N. 1970. Mass media flow and differential growth in knowledge. Public opinion quarterly, 34, 159-170.

TIEDENS, L. Z. & FRAGALE, A. R. 2003. Power moves: complementarity in dominant and submissive nonverbal behavior. Journal of personality and social psychology, 84, 558.

VASWANI, A., SHAZEER, N., PARMAR, N., USZKOREIT, J., JONES, L., GOMEZ, A. N., KAISER, Ł. & POLOSUKHIN, I. 2017. Attention is all you need. Advances in neural information processing systems, 30.

WALTER, Y. 2024. Embracing the future of Artificial Intelligence in the classroom: the relevance of AI literacy, prompt engineering, and critical thinking in modern education. International Journal of Educational Technology in Higher Education, 21, 15.

WANG, B., LIANG, X., YANG, J., HUANG, H., WU, S., WU, P., LU, L., MA, Z. & LI, Z. 2023a. Scm: Enhancing large language model with self-controlled memory framework. arXiv e-prints, arXiv: 2304.13343.

WANG, Q. & GOEL, A. K. 2022. Mutual theory of mind for human-AI communication. arXiv preprint arXiv:2210.03842.

WANG, Q., WU, J., TANG, Z., LUO, B., CHEN, N., CHEN, W. & HE, B. 2025. What Limits LLM-based Human Simulation: LLMs or Our Design? arXiv preprint arXiv:2501.08579.

WANG, S., COOPER, N. & EBY, M. 2024a. From human-centered to social-centered artificial intelligence: Assessing ChatGPT's impact through disruptive events. Big Data & Society, 11, 20539517241290220.

WANG, W., DONG, L., CHENG, H., LIU, X., YAN, X., GAO, J. & WEI, F. 2023b. Augmenting language models with long-term memory. Advances in Neural Information Processing Systems, 36, 74530-74543.

WANG, Y., HU, M., TA, N., SUN, H., GUO, Y., ZHOU, W., GUO, Y., ZHANG, W. & FENG, J. 2024b. Large language models and their application in government affairs. Journal of Tsinghua University (Science and Technology), 64, 649-658.

WANG, Z., SONG, R., SHEN, C., YIN, S., SONG, Z., BATTU, B., SHI, L., JIA, D., RAHWAN, T. & HU, S. 2024c. Overcoming the Machine Penalty with Imperfectly Fair AI Agents. arXiv preprint arXiv:2410.03724.

WEI, J., WANG, X., SCHUURMANS, D., BOSMA, M., XIA, F., CHI, E., LE, Q. V. & ZHOU, D. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in neural information processing systems, 35, 24824-24837.

WEN, B., WANG, C., HAN, Q., NOREL, R., LIU, J., STAPPENBECK, T. & ROGERS, J. L. 2025. Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery. arXiv preprint arXiv:2507.16229.

WILLIAMS, S., SCHUETT, J. & ANDERLJUNG, M. 2025. On Regulating Downstream AI Developers. European Journal of Risk Regulation, 1-29.

WU, T., ZHAO, Y. & ZHENG, Z. 2024. Never Miss A Beat: An Efficient Recipe for Context Window Extension of Large Language Models with Consistent" Middle" Enhancement. arXiv e-prints, arXiv: 2406.07138.

XU, Z., JAIN, S. & KANKANHALLI, M. 2024. Hallucination is inevitable: An innate limitation of large language models. arXiv preprint arXiv:2401.11817.

YAN, B., ZHOU, Z., ZHANG, L., ZHANG, L., ZHOU, Z., MIAO, D., LI, Z., LI, C. & ZHANG, X. 2025. Beyond self-talk: A communication-centric survey of llm-based multi-agent systems. arXiv preprint arXiv:2502.14321.

YANG, A., LI, A., YANG, B., ZHANG, B., HUI, B., ZHENG, B., YU, B., GAO, C., HUANG, C. & LV, C. 2025. Qwen3 technical report. arXiv preprint arXiv:2505.09388.

YIN, Z., WANG, H., HORIO, K., KAWAHARA, D. & SEKINE, S. Should we respect LLMs? A cross-lingual study on the influence of prompt politeness on LLM performance. Proceedings of the Second Workshop on Social Influence in Conversations (SICon 2024), 2024. 9-35.

YU, R. 2016. Stress potentiates decision biases: A stress induced deliberation-to-intuition (SIDI) model. Neurobiol Stress, 3, 83-95.

ZHANG, G., JIN, Q., ZHOU, Y., WANG, S., IDNAY, B., LUO, Y., PARK, E., NESTOR, J. G., SPOTNITZ, M. E. & SOROUSH, A. 2024a. Closing the gap between open source and commercial large language models for medical evidence summarization. NPJ digital medicine, 7, 239.

ZHANG, L., ERGEN, T., LOGESWARAN, L., LEE, M. & JURGENS, D. 2024b. Sprig: Improving large language model performance by system prompt optimization. arXiv preprint arXiv:2410.14826.

ZHANG, Z., LI, B., YAN, C., FURUICHI, K. & TODO, Y. 2025. Double Attention: An Optimization Method for the Self-Attention Mechanism Based on Human Attention. Biomimetics (Basel), 10.

ZHAO, P., ZHANG, H., YU, Q., WANG, Z., GENG, Y., FU, F., YANG, L., ZHANG, W., JIANG, J. & CUI, B. 2024. Retrieval-augmented generation for ai-generated content: A survey. arXiv preprint arXiv:2402.19473.

ZHONG, W., GUO, L., GAO, Q., YE, H. & WANG, Y. Memorybank: Enhancing large language models with long-term memory. Proceedings of the AAAI Conference on Artificial Intelligence, 2024. 19724-19731.

ZHOU, L., GAO, J., LI, D. & SHUM, H.-Y. 2020. The design and implementation of xiaoice, an empathetic social chatbot. Computational Linguistics, 46, 53-93.