Large Language Models News & Discussions

bnew · Apr 21, 2023

Macallik86 said:
Updated the title to be more inclusive to other Large Language models out there

add chatgpt in the tags and other related keywords.

bnew · Apr 24, 2023

bnew said:
https://arxiv.org/pdf/2302.07459.pdf

The Capacity for Moral Self-Correction in Large Language Models

snippet:

Abstract
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to “morally self-correct”—to avoid producing harmful outputs—if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability for moral self-correction emerges at 22B model parameters, and typically improves with increasing model size and RLHF training. We believe that at this level of scale, language models obtain two capabilities that they can use for moral self-correction: (1) they can follow instructions and (2) they can learn complex normative concepts of harm like stereotyping, bias, and discrimination. As such, they can follow instructions to avoid certain kinds of morally harmful outputs. We believe our results are cause for cautious optimism regarding the ability to train language models to abide by ethical principles.

bnew · Apr 29, 2023

MLC LLM | Home

mlc.ai

bnew · Apr 29, 2023

https://web.archive.org/web/20230430005058/https://twitter.com/lupantech/status/1652022897563795456

@lupantech

65B LLaMA-Adapter-V2 code & checkpoint are NOW ready at GitHub - ZrrSkywalker/LLaMA-Adapter: Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters!

Big update enhancing multimodality & chatbot.

LLaMA-Adapter-V2 surpasses #ChatGPT in response quality (102%:100%) & beats #Vicuna in win-tie-lost (50:14).

️Thanks to Peng Gao &
@opengvlab
!

LLaMA-Adapter/llama_adapter_v2_chat65b at main · ZrrSkywalker/LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters - LLaMA-Adapter/llama_adapter_v2_chat65b at main · ZrrSkywalker/LLaMA-Adapter

github.com

bnew · Apr 29, 2023

https://web.archive.org/save/https://twitter.com/gdb/status/1652369023609470976

edit:
copied the prompt using

Image-to-Multilingual-OCR

Gradio - a Hugging Face Space by awacke1

Act as a dual PhD in sports psychology and neuroscience: Your job is to design a system to someone addicted to something that will positively impact their life; in this case, starting an exercise habit (running): Create a 60 plan using research-backed principles to have anyone--even someone who hates running--build a running habit if they follow the plan: Incorporate research such as BF Skinner's study of addiction, BJ Fogg's Behavioral Model; and similar research on addiction and compulsion.

Outline a week-by-week plan; but give a detailed day-by-day plan for the first week:

bnew · Apr 29, 2023

https://web.archive.org/web/20230430012709/https://twitter.com/DrEalmutairi/status/1652272468105543681

https://unesdoc.unesco.org/ark:/48223/pf0000385146/PDF/385146eng.pdf.multi

ChatGPT and Artificial Intelligence in higher education
Quick start guide Portrait created by DALL.E 2, an AI system that can create realistic images and art in response to a text description. The AI was asked to produce an impressionist portrait of how artificial intelligence would look going to university. Concept by UNESCO IESALC

bnew · Apr 29, 2023

https://web.archive.org/web/20230430015051/https://twitter.com/jbrowder1/status/1652187049255120897

bnew · Apr 29, 2023

https://web.archive.org/web/20230430015207/https://twitter.com/gdb/status/1652411976713371649

Voice of Reason · Apr 29, 2023

I'm trying to work through the fast.ai course to learn this stuff

bnew · Apr 29, 2023

https://web.archive.org/web/20230430010034/https://twitter.com/emollick/status/1652170706312896512

bnew · Apr 29, 2023

https://web.archive.org/web/20230430022903/https://twitter.com/rasbt/status/1652288118924644352

Understanding Large Language Models

A Cross-Section of the Most Relevant Literature To Get Up to Speed

magazine.sebastianraschka.com

bnew · Apr 29, 2023

https://web.archive.org/web/20230430023005/https://twitter.com/AiBreakfast/status/1652368130684121089

bnew · Apr 29, 2023

https://web.archive.org/web/20230430023719/https://twitter.com/StabilityAI/status/1651984145944547329

bnew · Apr 29, 2023

https://web.archive.org/web/20230430024108/https://twitter.com/madiator/status/1652326887589556224

[2304.12210] A Cookbook of Self-Supervised Learning

A Cookbook of Self-Supervised Learning

Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann LeCun, Micah Goldblum

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high barrier to entry. While many components are familiar, successfully training a SSL method involves a dizzying set of choices from the pretext tasks to training hyper-parameters. Our goal is to lower the barrier to entry into SSL research by laying the foundations and latest SSL recipes in the style of a cookbook. We hope to empower the curious researcher to navigate the terrain of methods, understand the role of the various knobs, and gain the know-how required to explore how delicious SSL can be.

bnew · Apr 30, 2023

https://web.archive.org/web/20230430105553/https://twitter.com/carperai/status/1652025709953716224

CarperAI/stable-vicuna-13b-delta · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Large Language Models News & Discussions

More options

bnew

Veteran

bnew

Veteran

bnew

Veteran

MLC LLM | Home

bnew

Veteran

LLaMA-Adapter/llama_adapter_v2_chat65b at main · ZrrSkywalker/LLaMA-Adapter

bnew

Veteran

bnew

Veteran

bnew

Veteran

bnew

Veteran

Voice of Reason

Veteran

bnew

Veteran

bnew

Veteran

Understanding Large Language Models

bnew

Veteran

bnew

Veteran

bnew

Veteran

A Cookbook of Self-Supervised Learning

bnew

Veteran

CarperAI/stable-vicuna-13b-delta · Hugging Face

Large Language Models News & Discussions

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

A Cookbook of Self-Supervised Learning​

Veteran

A Cookbook of Self-Supervised Learning