Skip to main content

Showing 1–8 of 8 results for author: Mattern, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.18462  [pdf, other

    cs.CL cs.CR cs.LG

    Membership Inference Attacks against Language Models via Neighbourhood Comparison

    Authors: Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Sch�lkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick

    Abstract: Membership Inference attacks (MIAs) aim to predict whether a data sample was present in the training data of a machine learning model or not, and are widely used for assessing the privacy risks of language models. Most existing attacks rely on the observation that models tend to assign higher probabilities to their training samples than non-training points. However, simple thresholding of the mode… ▽ More

    Submitted 7 August, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  2. arXiv:2305.09859  [pdf, other

    cs.CL cs.LG

    Smaller Language Models are Better Black-box Machine-Generated Text Detectors

    Authors: Niloofar Mireshghallah, Justus Mattern, Sicun Gao, Reza Shokri, Taylor Berg-Kirkpatrick

    Abstract: With the advent of fluent generative language models that can produce convincing utterances very similar to those written by humans, distinguishing whether a piece of text is machine-generated or human-written becomes more challenging and more important, as such models could be used to spread misinformation, fake news, fake reviews and to mimic certain authors and figures. To this end, there have… ▽ More

    Submitted 24 February, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

  3. arXiv:2305.01764  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    Psychologically-Inspired Causal Prompts

    Authors: Zhiheng Lyu, Zhijing Jin, Justus Mattern, Rada Mihalcea, Mrinmaya Sachan, Bernhard Schoelkopf

    Abstract: NLP datasets are richer than just input-output pairs; rather, they carry causal relations between the input and output variables. In this work, we take sentiment classification as an example and look into the causal relations between the review (X) and sentiment (Y). As psychology studies show that language can affect emotion, different psychological processes are evoked when a person first makes… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  4. arXiv:2302.08927  [pdf, other

    cs.CR cs.LG

    Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data

    Authors: Vivek Nair, Wenbo Guo, Justus Mattern, Rui Wang, James F. O'Brien, Louis Rosenberg, Dawn Song

    Abstract: With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose. While it has long been known that people reveal information about themselves via their motion, the extent to which this makes an individual globally identifiable within v… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Journal ref: 32nd USENIX Security Symposium (2023) 895-910

  5. arXiv:2212.10678  [pdf, other

    cs.CL cs.LG

    Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias

    Authors: Yuen Chen, Vethavikashini Chithrra Raghuram, Justus Mattern, Rada Mihalcea, Zhijing Jin

    Abstract: Generated texts from large language models (LLMs) have been shown to exhibit a variety of harmful, human-like biases against various demographics. These findings motivate research efforts aiming to understand and measure such effects. This paper introduces a causal formulation for bias measurement in generative language models. Based on this theoretical foundation, we outline a list of desiderata… ▽ More

    Submitted 20 October, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

  6. arXiv:2210.13918  [pdf, other

    cs.LG cs.CL cs.CR

    Differentially Private Language Models for Secure Data Sharing

    Authors: Justus Mattern, Zhijing Jin, Benjamin Weggenmann, Bernhard Schoelkopf, Mrinmaya Sachan

    Abstract: To protect the privacy of individuals whose data is being shared, it is of high importance to develop methods allowing researchers and companies to release textual data while providing formal privacy guarantees to its originators. In the field of NLP, substantial efforts have been directed at building mechanisms following the framework of local differential privacy, thereby anonymizing individual… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  7. arXiv:2205.02130  [pdf, other

    cs.CR cs.CL cs.LG

    The Limits of Word Level Differential Privacy

    Authors: Justus Mattern, Benjamin Weggenmann, Florian Kerschbaum

    Abstract: As the issues of privacy and trust are receiving increasing attention within the research community, various attempts have been made to anonymize textual data. A significant subset of these approaches incorporate differentially private mechanisms to perturb word embeddings, thus replacing individual words in a sentence. While these methods represent very important contributions, have various advan… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  8. arXiv:2203.08085  [pdf, other

    cs.CL

    Measuring the Impact of (Psycho-)Linguistic and Readability Features and Their Spill Over Effects on the Prediction of Eye Movement Patterns

    Authors: Daniel Wiechmann, Yu Qiao, Elma Kerz, Justus Mattern

    Abstract: There is a growing interest in the combined use of NLP and machine learning methods to predict gaze patterns during naturalistic reading. While promising results have been obtained through the use of transformer-based language models, little work has been undertaken to relate the performance of such models to general text characteristics. In this paper we report on experiments with two eye-trackin… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: accepted at ACL 2022