Akademska digitalna zbirka SLovenije - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov konzorcija SI. Za polni dostop se PRIJAVITE.

1 2 3 4
zadetkov: 36
11.
  • Scaling Out-of-Distribution Detection for Real-World Settings
    Hendrycks, Dan; Basart, Steven; Mantas Mazeika ... arXiv (Cornell University), 05/2022
    Paper, Journal Article
    Odprti dostop

    Detecting out-of-distribution examples is important for safety-critical machine learning applications such as detecting novel biological phenomena and self-driving cars. However, existing research ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
12.
  • HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
    Mantas Mazeika; Long, Phan; Yin, Xuwang ... arXiv (Cornell University), 02/2024
    Paper, Journal Article
    Odprti dostop

    Automated red teaming holds substantial promise for uncovering and mitigating the risks associated with the malicious use of large language models (LLMs), yet the field lacks a standardized ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
13.
  • DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
    Wang, Boxin; Chen, Weixin; Pei, Hengzhi ... arXiv (Cornell University), 02/2024
    Paper, Journal Article
    Odprti dostop

    Generative Pre-trained Transformer (GPT) models have exhibited exciting progress in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the literature on the ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
14.
  • What Would Jiminy Cricket Do? Towards Agents That Behave Morally
    Hendrycks, Dan; Mantas Mazeika; Zou, Andy ... arXiv (Cornell University), 02/2022
    Paper, Journal Article
    Odprti dostop

    When making everyday decisions, people are guided by their conscience, an internal sense of right and wrong. By contrast, artificial agents are currently not endowed with a moral sense. As a ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
15.
  • Measuring Massive Multitask Language Understanding
    Hendrycks, Dan; Burns, Collin; Basart, Steven ... arXiv (Cornell University), 01/2021
    Paper, Journal Article
    Odprti dostop

    We propose a new test to measure a text model's multitask accuracy. The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more. To attain high accuracy on ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
16.
  • How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
    Mantas Mazeika; Tang, Eric; Zou, Andy ... arXiv (Cornell University), 10/2022
    Paper, Journal Article
    Odprti dostop

    In recent years, deep neural networks have demonstrated increasingly strong abilities to recognize objects and activities in videos. However, as video understanding becomes widely used in real-world ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
17.
  • Forecasting Future World Events with Neural Networks
    Zou, Andy; Xiao, Tristan; Jia, Ryan ... arXiv (Cornell University), 10/2022
    Paper, Journal Article
    Odprti dostop

    Forecasting future world events is a challenging but valuable task. Forecasts of climate, geopolitical conflict, pandemics and economic indicators help shape policy and decision making. In these ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
18.
  • Deep Anomaly Detection with Outlier Exposure
    Hendrycks, Dan; Mantas Mazeika; Dietterich, Thomas arXiv (Cornell University), 01/2019
    Paper, Journal Article
    Odprti dostop

    It is important to detect anomalous inputs when deploying machine learning systems. The use of larger and more complex inputs in deep learning magnifies the difficulty of distinguishing between ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
19.
  • Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
    Hendrycks, Dan; Mantas Mazeika; Wilson, Duncan ... arXiv (Cornell University), 01/2019
    Paper, Journal Article
    Odprti dostop

    The growing importance of massive datasets used for deep learning makes robustness to label noise a critical property for classifiers to have. Sources of label noise include automatic labeling, ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
20.
  • Representation Engineering: A Top-Down Approach to AI Transparency
    Zou, Andy; Long, Phan; Chen, Sarah ... arXiv (Cornell University), 10/2023
    Paper, Journal Article
    Odprti dostop

    In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive ...
Celotno besedilo
Dostopno za: NUK, UL, UM, UPUK
1 2 3 4
zadetkov: 36

Nalaganje filtrov