Search

Scholarly Works (38 results)

Sort By:

Show:

Article
Peer Reviewed

Can Action Bias the Perception of Ambiguous Auditory Stimuli?

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 43 (2021)

According to the theory of common coding, actions are represented in terms of their sensory effects. Hence, performing or anticipating an action biases perception. Previous studies provided evidence for this notion by showing how the perception of ambiguous visual stimuli can be affected by concurrent actions. Here we investigated whether performing a directed action can affect the perception of ambiguous auditory stimuli in a gamified dual-task. In an online study, participants had to avoid obstacles in an endless runner game while classifying the pitch shift in an ambiguous sequence of Shepard tones. Response times indicate interference between both tasks, but pitch shift classifications seem to be unaffected by the motor task. Meanwhile, participants showed a strong compatibility effect between base pitch and pitch shift classifications, in line with a typical SMARC effect. We discuss possible reasons for the absence of perceptual modulations and implications for common coding approaches.

Cover page: Can Action Bias the Perception of Ambiguous Auditory Stimuli?

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

Gaze strategies in object identification and manipulation

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 35 (2013)

Article
Peer Reviewed

Learning Temporal Generative Neural Codes for Biological Motion Perceptionand Inference

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 39 (2017)

We introduce a modular recurrent neural architecture, which learns distributed, generative temporal models of bio-logical motion. It encodes modal visual and proprioceptive (angular) biological motions separately by means of autoencoders,structuring respective postures, motion directions, and motion magnitudes separately. The submodal encoders are interdepen-dent by predicting each other’s next autoencoder states temporally. As a result, distributed attractor states can develop fromself-generated motions. We show that the architecture is able to synchronize its activities across modalities towards overallconsistent action-encoding attractors. Moreover, the developing spatial and temporal structures can complete partially observ-able actions, e.g., when only providing visual information. Furthermore, we show that the network is capable of simulatingwhole-body actions without any sensory stimulation, thus imagining unfolding actions. Finally, we show that the network isable to infer the visual perspective on a biological motion. Thus, the neural architecture enables embodied perspective takingand action inference.

Cover page: Learning Temporal Generative Neural Codes for Biological Motion Perceptionand Inference

Article
Peer Reviewed

Hands in Thought and Motion

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 42 (2020)

Theories of event-predictive, anticipatory behavior controlsuggest that complex action planning and control is segmentedinto sequences of anticipated subgoals and according behav-ioral events, which accomplish the subgoals. Here we focus onthe cognitive dynamics during successive subgoal activations.We combined a virtual object interaction task (prehension andtransport of a bottle) with a crossmodal congruency task. An-ticipatory crossmodal congruency effects (aCCEs) occur at thegoal of the current behavior, before the goal is reached. TheseaCCEs appear to be stronger during prehension, while visualdistractors at the currently irrelevant movement target have noeffect. While the results so far provide only partial supportfor the proposed anticipatory, sequential control process, theparadigm is well-suited to probe the dynamic changes of spa-tial body representations in object interactions.

Article
Peer Reviewed

Unflinching Predictions: Anticipatory Crossmodal Interactions are Unaffected bythe Current Hand Posture

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 41 (2019)

According to theories of anticipatory behavior control, actionplanning and control is realized by activating desired goalstates. From an event-predictive perspective, this activationshould focus sensorimotor processing on expected, upcomingevent boundaries. Previous studies have shown that periper-sonal hand space (PPHS) is remapped to the future hand lo-cation in a grasping task before the movement commences.Here, we investigated if the current hand posture interfereswith the anticipatory remapping of PPHS. Participants had tograsp virtual bottles from two differently oriented starting pos-tures. During the prehension, they received a vibrotactile stim-ulus on their right index finger or on their thumb, while a vi-sual stimulus appeared at the bottle, either matching the futurefinger position, or not. Participants had to name the stimu-lated finger. While the hand posture affected verbal responsetimes, the anticipatory remapping remained unchanged. Ap-parently, the predictive processes that realize the anticipatoryremapping, generalize over initial hand postures.

Cover page: Unflinching Predictions: Anticipatory Crossmodal Interactions are Unaffected bythe Current Hand Posture

Article
Peer Reviewed

It's all in the eye: multiple orders of motor planning in gaze control

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 37 (2015)

It has been shown that the eyes anticipate the target of the next manual object interaction. Meanwhile, manual interactions anticipate object features for grasp adjustments (first order) and the most convenient end-state in anticipation of subsequent tasks (second order). Moreover, grasping kinematics for the same object can vary depending on the final goal (third order planning). In an eye-tracking experiment we show that these factors can be measured already in eye fixations prior to grasping objects with different orientations (upright vs. inverted) and for different tasks (drink vs. hand over). Fixation measures show significant effects of object, task, and orientation and significant interactions. These results show for the first time end-state comfort effects in the eyes and suggest a tighter coupling of oculo-motor and motor programming than assumed so far. The insights suggest that even more intricate derivations of manipulation intentions can be derived from eye gaze data.

Cover page: It's all in the eye: multiple orders of motor planning in gaze control

Article
Peer Reviewed

Modeling the Anticipatory Remapping of Spatial Body Representations: A Free Energy Approach

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 43 (2021)

According to theories of event-predictive cognition, neural processing focuses on the next relevant interaction targets. Evidence for this notion comes from the anticipatory crossmodal congruency effect (aCCE), which implies that spatial body representations are mapped onto future goal locations in advance of a goal-directed action. Here we present a free energy based normative process model that accounts for the aCCE quantitatively by applying crossmodal mappings between vision and touch as well as active inference. A comparison with a diffusion model shows that our model accounts for the response time distributions and the aCCE with a sparser set of parameters. However, the temporal dynamics of the model require further fine tuning to account for all aspects of the aCCE. The model shows how the free energy framework can be used to account for behavioral data in general and how to implement theories of event-predictive cognition in a normative cognitive process model.

Cover page: Modeling the Anticipatory Remapping of Spatial Body Representations: A Free Energy Approach

Article
Peer Reviewed

Efficient learning through compositionality in a CNN-RNN model consisting of a bottom-up and a top-down pathway

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 44 (2022)

Learning to write is characterized by bottom-up mimicking of characters and top-down writing from memory. We introduce a CNN-RNN model that implements both pathways: It can (i) directly write a letter by generating a motion trajectory given an image, (ii) first classify the character in the image and then determine its motion trajectory `from memory', or (iii) use a combination of both pathways. The results show that, in one-shot and few-shot learning, the model profits from different combinations of the pathways: The generation of different character variants works best when the top-down is supported by the bottom-up pathway. Refilling occluded images of efficiently learned characters works best when using the top-down pathway alone. Overall, the architecture implies that a weighted merge of bottom-up and top-down information into a latent, generative code fosters the development of compositional encodings, which can be reused in efficient learning tasks.

Cover page: Efficient learning through compositionality in a CNN-RNN model consisting of a bottom-up and a top-down pathway

Article
Peer Reviewed

Grasping Multisensory Integration: Proprioceptive Capture after Virtual ObjectInteractions

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 39 (2017)

According to most recent theories of multisensory integration,weighting of different modalities depends on the reliability ofthe involved sensory estimates. Top-down modulations havebeen studied to a lesser degree. Furthermore, it is still debatedwhether working memory maintains multisensory informationin a distributed modal fashion, or in terms of an integrated rep-resentation. To investigate whether multisensory integrationis modulated by task relevance and to probe the nature of theworking memory encodings, we combined an object interac-tion task with a size estimation task in an immersive virtualreality. During the object interaction, we induced multisen-sory conflict between seen and felt grip aperture. Both, visualand proprioceptive size estimation showed a clear modulationby the experimental manipulation. Thus, the results suggestthat multisensory integration is not only driven by reliability,but is also biased by task demands. Furthermore, multisensoryinformation seems to be represented by means of interactivemodal representations.

Cover page: Grasping Multisensory Integration: Proprioceptive Capture after Virtual ObjectInteractions

Article
Peer Reviewed

Anticipatory Active Inference from Learned Recurrent Neural Forward Models

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 39 (2017)

We demonstrate that inference-based goal-directed behavior can be done by utilizing the temporal gradients in re-current neural network (RNN). The RNN learns a dynamic sensorimotor forward model. Once the RNN is trained, it can beused to execute active-inference-based, goal-directed policy optimization. The internal neural activities of the trained RNNessentially model the predictive state of the controlled entity. The implemented optimization process projects the neural activ-ities into the future via the RNN recurrences following a tentative sequence of motor commands (encoded in neurons akin torecurrent parametric biases). This sequence is adapted by back-projecting the error between the forward-projected hypotheticalstates and desired (goal-like) system states onto the motor commands. Few cycles of forward projection and goal-based errorbackpropagation yield the sequences of motor commands that control the dynamical systems. As an example, we show that atrained RNN model can be used to effectively control a quadrocopter-like system.

Cover page: Anticipatory Active Inference from Learned Recurrent Neural Forward Models