(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–8 of 8 results for author: McLean, E

.
  1. arXiv:2406.12843  [pdf, other

    cs.LG cs.AI stat.ML

    Can Go AIs be adversarially robust?

    Authors: Tom Tseng, Euan McLean, Kellin Pelrine, Tony T. Wang, Adam Gleave

    Abstract: Prior work found that superhuman Go AIs like KataGo can be defeated by simple adversarial strategies. In this paper, we study if simple defenses can improve KataGo's worst-case performance. We test three natural defenses: adversarial training on hand-constructed positions, iterated adversarial training, and changing the network architecture. We find that some of these defenses are able to protect… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 67 pages

  2. arXiv:2312.14302  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Exploiting Novel GPT-4 APIs

    Authors: Kellin Pelrine, Mohammad Taufeeque, Michał Zając, Euan McLean, Adam Gleave

    Abstract: Language model attacks typically assume one of two extreme threat models: full white-box access to model weights, or black-box access limited to a text generation API. However, real-world APIs are often more flexible than just text generation: these APIs expose "gray-box" access leading to new threat vectors. To explore this, we red-team three new functionalities exposed in the GPT-4 APIs: fine-tu… ▽ More

    Submitted 4 August, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 10 pages, 1 figure, 4 tables

    ACM Class: I.2.7

  3. arXiv:2306.09479  [pdf, other

    cs.CL cs.AI cs.CY

    Inverse Scaling: When Bigger Isn't Better

    Authors: Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim , et al. (2 additional authors not shown)

    Abstract: Work on scaling laws has found that large language models (LMs) show predictable improvements to overall loss with increased scale (model size, training data, and compute). Here, we present evidence for the claim that LMs may show inverse scaling, or worse task performance with increased scale, e.g., due to flaws in the training objective and data. We present empirical evidence of inverse scaling… ▽ More

    Submitted 12 May, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Published in TMLR (2023), 39 pages

    Journal ref: Transactions on Machine Learning Research (TMLR), 10/2023, https://openreview.net/forum?id=DwgRm72GQF

  4. arXiv:2212.11281  [pdf, other

    cs.CL cs.AI cs.LG

    Language models are better than humans at next-token prediction

    Authors: Buck Shlegeris, Fabien Roger, Lawrence Chan, Euan McLean

    Abstract: Current language models are considered to have sub-human capabilities at natural language tasks like question-answering or writing code. However, language models are not trained to perform well at these tasks, they are trained to accurately predict the next token given previous tokes in tokenized text. It is not clear whether language models are better or worse than humans at next token prediction… ▽ More

    Submitted 15 July, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Edit: TMLR 2024, more analysis of the results were added

  5. $B_s\to D_s \ellνにゅー$ Form Factors for the full $q^2$ range from Lattice QCD with non-perturbatively normalized currents

    Authors: E. McLean, C. T. H. Davies, J. Koponen, A. T. Lytle

    Abstract: We present a lattice QCD determination of the $B_s \to D_s \ellνにゅー$ scalar and vector form factors over the full physical range of momentum transfer. The result is derived from correlation functions computed using the Highly Improved Staggered Quark (HISQ) formalism, on the second generation MILC gluon ensembles accounting for up, down, strange and charm contributions from the sea. We calculate corr… ▽ More

    Submitted 14 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 22 pages, 17 figures

    Journal ref: Phys. Rev. D 101, 074513 (2020)

  6. Lattice QCD form factor for $B_s\to D_s^* lνにゅー$ at zero recoil with non-perturbative current renormalisation

    Authors: E. McLean, C. T. H. Davies, A. T. Lytle, J. Koponen

    Abstract: We present details of a lattice QCD calculation of the $B_s\to D_s^*$ axial form factor at zero recoil using the Highly Improved Staggered Quark (HISQ) formalism on the second generation MILC gluon ensembles that include up, down, strange and charm quarks in the sea. Using the HISQ action for all valence quarks means that the lattice axial vector current that couples to the $W$ can be renormalized… ▽ More

    Submitted 31 May, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: 18 pages, 9 figures

    Journal ref: Phys. Rev. D 99, 114512 (2019)

  7. arXiv:1901.04979  [pdf, other

    hep-lat hep-ph

    $B_s\to D_s^{(*)}lνにゅー$ Form Factors with Heavy HISQ Quarks

    Authors: E. McLean, C. T. H. Davies, A. T. Lytle, J. Koponen

    Abstract: We present progress on an ongoing calculation of the $B_s\to D_s^{(*)} l νにゅー$ form factors calculated on the $n_f=2+1+1$ MILC ensembles and using the Highly Improved Staggered Quark action for all valence quarks. We perform the calculation at a range of $b$ quark masses (and lattice spacings) so that we can extrapolate to the physical $b$-quark mass.

    Submitted 15 January, 2019; originally announced January 2019.

    Comments: 6 pages, 3 figures, proceedings of the 36th Annual International Symposium on Lattice Field Theory - LATTICE2018

    Journal ref: PoS(LATTICE2018)282

  8. The $B_{(s)} \to D_{(s)}lνにゅー$ Decay with Highly Improved Staggered Quarks and NRQCD

    Authors: Euan McLean, Christine T. H. Davies, Brian Colquhoun, Andrew Lytle

    Abstract: We report on progress of a lattice QCD calculation of the $B\to Dlνにゅー$ and $B_s\to D_s lνにゅー$ semileptonic form factors. We use a relativistic staggered action (HISQ) for light and charm quarks, and an improved non-relativistic (NRQCD) action for bottom, on the second generation MILC ensembles.

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: Presented at Lattice 2017, the 35th International Symposium on Lattice Field Theory at Granada, Spain (18-24 June 2017)