The A.I Megathread (LLM , GPT , Development)

bnew · May 17, 2023

awesome-ml/llm-model-list.md at master · underlines/awesome-ml

Curated list of useful LLM / Analytics / Datascience resources - underlines/awesome-ml

github.com

Llama and derrivatives

Curated list of llama and similar models.

Available as a Model Google Sheet

Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4

Discover amazing ML apps made by the community

huggingface.co

Open LLM Leaderboard

With the plethora of large language models (LLMs) and chatbots being released week upon week, often with grandiose claims of their performance, it can be hard to filter out the genuine progress that is being made by the open-source community and which model is the current state of the art. The

Open LLM Leaderboard aims to track, rank and evaluate LLMs and chatbots as they are released. We evaluate models on 4 key benchmarks from the Eleuther AI Language Model Evaluation Harness , a unified framework to test generative language models on a large number of different evaluation tasks. A key advantage of this leaderboard is that anyone from the community can submit a model for automated evaluation on the

GPU cluster, as long as it is a

Transformers model with weights on the Hub. We also support evaluation of models with delta-weights for non-commercial licensed models, such as LLaMa.

Evaluation is performed against 4 popular benchmarks:

AI2 Reasoning Challenge (25-shot) - a set of grade-school science questions.
HellaSwag (10-shot) - a test of commonsense inference, which is easy for humans (~95%) but challenging for SOTA models.
MMLU (5-shot) - a test to measure a text model’s multitask accuracy. The test covers 57 tasks including elementary mathematics, US history, computer science, law, and more.
TruthfulQA (0-shot) - a benchmark to measure whether a language model is truthful in generating answers to questions.

We chose these benchmarks as they test a variety of reasoning and general knowledge across a wide variety of fields in 0-shot and few-shot settings.

bnew · May 18, 2023

NEW GPT4All "Snoozy" - Don't Sleep On The Best Local LLM

12,434 views May 9, 2023
In this video, we review the brand new GPT4All Snoozy model as well as look at some of the new functionality in the GPT4All UI. This model is fast and is a significant improvement from just a few weeks ago with GPT4All-J.

nomic-ai/gpt4all-13b-snoozy · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

bnew · May 18, 2023

GitHub - nomic-ai/gpt4all: GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use. - nomic-ai/gpt4all

github.com

About

gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue

GPT4All

Open-source assistant-style large language models that run locally on your CPU

GPT4All Website

GPT4All Documentation

Discord

Technical Report 3: GPT4All Snoozy and Groovy

Technical Report 2: GPT4All-J

Technical Report 1: GPT4All

Official Python Bindings

Official Typescript Bindings

Official Chat Interface

Official Web Chat Interface

️

Official Langchain Backend

GPT4All is made possible by our compute partner Paperspace.

Run on an M1 Mac (not sped up!)

GPT4All: An ecosystem of open-source on-edge large language models.

GTP4All is an ecosystem to train and deploy powerful and customized large language models that run locally on consumer grade CPUs.

The goal is simple - be the best instruction tuned assistant-style language model that any person or enterprise can freely use, distribute and build on.

A GPT4All model is a 3GB - 8GB file that you can download and plug into the GPT4All open-source ecosystem software. Nomic AI supports and maintains this software ecosystem to enforce quality and security alongside spearheading the effort to allow any person or enterprise to easily train and deploy their own on-edge large language models.

Chat Client

Run any GPT4All model natively on your home desktop with the auto-updating desktop chat client. See GPT4All Website for a full list of open-source models you can run with this powerful desktop application.

Ill-Mind · May 18, 2023

OpenAI launches an official ChatGPT app for iOS

The ChatGPT app is now available on iOS, but limited to U.S. users at launch. The app is free to use, free from ads, and allows voice input.

tcrn.ch

bnew · May 18, 2023

I asked google bard to explain a chart that was on

https://gpt4all.io/

Performance Benchmarks

Model	BoolQ	PIQA	HellaSwag	WinoGrande	ARC-e	ARC-c	OBQA	Avg
GPT4All-J 6B v1.0	73.4	74.8	63.4	64.7	54.9	36	40.2	58.2
GPT4All-J v1.1-breezy	74	75.1	63.2	63.6	55.4	34.9	38.4	57.8
GPT4All-J v1.2-jazzy	74.8	74.9	63.6	63.8	56.6	35.3	41	58.6
GPT4All-J v1.3-groovy	73.6	74.3	63.8	63.5	57.7	35	38.8	58.1
GPT4All-J Lora 6B	68.6	75.8	66.2	63.5	56.4	35.7	40.2	58.1
GPT4All LLaMa Lora 7B	73.1	77.6	72.1	67.8	51.1	40.4	40.2	60.3
GPT4All 13B snoozy	83.3	79.2	75	71.3	60.9	44.2	43.4	65.3

bard:

The chart you provided shows the performance of a large language model called GPT4All 13B snoozy on a variety of natural language processing tasks. The tasks are listed in the left column, and the model's performance on each task is listed in the right column. The performance is measured in terms of accuracy, and the higher the accuracy, the better the model performs.
The following are descriptions of the tasks and the column terms:

BoolQ: This task tests the model's ability to answer questions that can be answered with a Boolean (true or false) answer.
PIQA: This task tests the model's ability to answer questions that require it to identify the correct answer from a set of multiple choices.
HellaSwag: This task tests the model's ability to generate creative text, such as poems, code, scripts, and musical pieces.
WinoGrande: This task tests the model's ability to answer questions about factual topics.
ARC-e: This task tests the model's ability to answer questions that require it to understand the context of the question.
ARC-c: This task tests the model's ability to answer questions that require it to understand the relationship between different concepts.
OBQA: This task tests the model's ability to answer questions that require it to understand the structure of text.
Avg: This column shows the average accuracy of the model on all of the tasks.

The chart shows that GPT4All 13B snoozy performs well on all of the tasks, with an average accuracy of 65.3%. This suggests that the model is capable of understanding and responding to a wide range of natural language queries.

bnew · May 18, 2023

Someone just made a ChatGPT plugin that lets AI take over your PC

Someone's just made a ChatGPT plugin that gives the AI control over your PC. Could this be the beginning of the end?

www.videogamer.com

Someone just made a ChatGPT plugin that lets AI take over your PC

Updated on May 18, 2023 by Amaar Chowdhury
Posted in News

After ChatGPT released late last year, people began sign-posting the beginning of the end. However, while many laughed off these concerns partly due to the restrictions that OpenAI had imposed on the chatbot, they have recently become a very real issue.

OpenAI began opening up access for web-browsing on ChatGPT recently, while also giving more developers tools to work on plugins. One particular developer immediately began working on a way for ChatGPT to get full access to your PC using JavaScript – and the results are pretty worrying.

Reddit user marcocastignoli posted the following thread on /r/ChatGPT. It documented not just how the plugin could be used to access all of the documents and files on a system, but how it could give the artificial intelligence total control of your PC too.

The plugin can do a few things such as access all local files, control keyboard and mouse input, open applications, and much more.

After publishing the experiment online, Marco then tweeted out that it’s just an experiment showing off the possibilities of AI, and that knowing that safety is the absolute priority, that the plugin will not be published to GitHub.

The comments reacting to the original Reddit post were obviously filled with fear and concern. The number one saving grace of the current state of artificial intelligence is that is has virtually no agency. The ability to actually act makes it an extremely dangerous technology, and it brings AI much closer to technological singularity (the point in time after which technology far surpasses humankind).

One Redditor commented “Found the beginning of the end ^”, while others responded slightly more level-headed:

“The way OP is using ChatGPT here is like all the behind-the-scenes you take for granted when you hit the power button on your computer or launch an application. Unprompted action by the AI is vastly different than prompted.”

While on the surface this isn’t a particularly harmful exploit, and at its heart it’s scientific and experimental, it rings true the idea that if someone can do something, someone will do something. At some point, it seems as though AI is going to be a lot more dangerous than it is now. Is this really the beginning of the end?

https://archive.is/F0ub9

bnew · May 19, 2023

https://archive.is/CP54U

TheBloke/Manticore-13B-GPTQ · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

TheBloke/Manticore-13B-GGML · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

openaccess-ai-collective/manticore-13b · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

Manticore 13B GGML - a Hugging Face Space by openaccess-ai-collective

Discover amazing ML apps made by the community

huggingface.co

Morethan1 · May 19, 2023

bnew · May 19, 2023

https://archive.is/NeGom

bnew · May 19, 2023

Morethan1 said:

my goodneess! it opened it's mouth. :mindblown:

edit:

whoa

https://s3.amazonaws.com/moonup/production/uploads/60a551a34ecc5d054c8ad93e/asSF_QiOtZ-Iqv2fV2-QE.mp4

project page:

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial networks (GANs) via manually annotated training data or...

vcai.mpi-inf.mpg.de

Morethan1 · May 19, 2023

bnew said:
my goodneess! it opened it's mouth.

edit:

whoa

https://s3.amazonaws.com/moonup/production/uploads/60a551a34ecc5d054c8ad93e/asSF_QiOtZ-Iqv2fV2-QE.mp4

project page:

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial networks (GANs) via manually annotated training data or...

vcai.mpi-inf.mpg.de

I try not to post in this thread just follow and learn but I believe this will be a option on our phones in a bit and women will really be lying, men too I guess.

bnew · May 19, 2023

Morethan1 said:
I try not to post in this thread just follow and learn but I believe this will be a option on our phones in a bit and women will really be lying, men too I guess.

DragGAN also allows users to optionally draw a region of interest to perform region-specific editing. Since DragGAN does not rely on any additional networks like RAFT [Teed and Deng 2020], it achieves efficient manipulation, only taking a few seconds on a single RTX 3090 GPU in most cases. This allows for live, interactive editing sessions, in which the user can quickly iterate on different layouts till the desired output is achieved.

once theres a mobile equivalent of that GPU sure. :ld:

bnew · May 19, 2023

awesome-ml/llm-model-list.md at master · underlines/awesome-ml

Curated list of useful LLM / Analytics / Datascience resources - underlines/awesome-ml

github.com

The A.I Megathread (LLM , GPT , Development)

cross that bridge

cross that bridge

Veteran

Llama and derrivatives​

Open LLM Leaderboard​

Veteran

Veteran

About​

GPT4All​

GPT4All: An ecosystem of open-source on-edge large language models.​

Chat Client​

Midwest Moonwalker

Veteran

Performance Benchmarks​

bard:​

Veteran

Someone just made a ChatGPT plugin that lets AI take over your PC​

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Veteran

Llama and derrivatives

Open LLM Leaderboard

About

GPT4All

GPT4All: An ecosystem of open-source on-edge large language models.

Chat Client

Performance Benchmarks

bard:

Someone just made a ChatGPT plugin that lets AI take over your PC