NUK - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov NUK. Za polni dostop se PRIJAVITE.

1 2
zadetkov: 18
11.
  • HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
    Mantas Mazeika; Long, Phan; Yin, Xuwang ... arXiv.org, 02/2024
    Paper, Journal Article
    Odprti dostop

    Automated red teaming holds substantial promise for uncovering and mitigating the risks associated with the malicious use of large language models (LLMs), yet the field lacks a standardized ...
Celotno besedilo
12.
  • Measuring Massive Multitask Language Understanding
    Hendrycks, Dan; Burns, Collin; Basart, Steven ... arXiv (Cornell University), 01/2021
    Paper, Journal Article
    Odprti dostop

    We propose a new test to measure a text model's multitask accuracy. The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. To attain high accuracy on ...
Celotno besedilo
13.
  • How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
    Mantas Mazeika; Tang, Eric; Zou, Andy ... arXiv (Cornell University), 10/2022
    Paper, Journal Article
    Odprti dostop

    In recent years, deep neural networks have demonstrated increasingly strong abilities to recognize objects and activities in videos. However, as video understanding becomes widely used in real-world ...
Celotno besedilo
14.
  • Representation Engineering: A Top-Down Approach to AI Transparency
    Zou, Andy; Long, Phan; Chen, Sarah ... arXiv.org, 10/2023
    Paper, Journal Article
    Odprti dostop

    In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive ...
Celotno besedilo
15.
  • Measuring Coding Challenge Competence With APPS
    Hendrycks, Dan; Basart, Steven; Kadavath, Saurav ... arXiv (Cornell University), 11/2021
    Paper, Journal Article
    Odprti dostop

    While programming is one of the most broadly applicable skills in modern society, modern machine learning models still cannot code solutions to basic problems. Despite its importance, there has been ...
Celotno besedilo
16.
  • The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
    Hendrycks, Dan; Basart, Steven; Mu, Norman ... arXiv (Cornell University), 07/2021
    Paper, Journal Article
    Odprti dostop

    We introduce four new real-world distribution shift datasets consisting of changes in image style, image blurriness, geographic location, camera operation, and more. With our new datasets, we take ...
Celotno besedilo
17.
  • The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
    Li, Nathaniel; Pan, Alexander; Gopal, Anjali ... arXiv.org, 05/2024
    Paper, Journal Article
    Odprti dostop

    The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To ...
Celotno besedilo
18.
  • DIODE: A Dense Indoor and Outdoor DEpth Dataset
    Vasiljevic, Igor; Kolkin, Nick; Zhang, Shanyi ... arXiv (Cornell University), 08/2019
    Paper, Journal Article
    Odprti dostop

    We introduce DIODE, a dataset that contains thousands of diverse high resolution color images with accurate, dense, long-range depth measurements. DIODE (Dense Indoor/Outdoor DEpth) is the first ...
Celotno besedilo
1 2
zadetkov: 18

Nalaganje filtrov