The A.I Megathread (LLM , GPT , Development)

bnew · Sep 11, 2023

Wifi can read through walls

UCSB researchers’ new method enables high-quality imaging of still objects with WiFi by using the Geometrical Theory of Diffraction and the corresponding Keller cones to trace edges of the objects.

news.ucsb.edu

Mostofi Lab’s latest research makes significant progress in imaging still objects with WiFi, by exploiting the interaction of the edges with the incoming waves using the Geometrical Theory of Diffraction (GTD). This further enables the first demonstration of WiFi reading through walls.

SCIENCE + TECHNOLOGY

September 11, 2023

Wifi can read through walls

Sonia Fernandez

Researchers in UC Santa Barbara professor Yasamin Mostofi’s lab have proposed a new foundation that can enable high-quality imaging of still objects with only WiFi signals. Their method uses the Geometrical Theory of Diffraction and the corresponding Keller cones to trace edges of the objects. The technique has also enabled, for the first time, imaging, or reading, the English alphabet through walls with WiFi, a task deemed too difficult for WiFi due to the complex details of the letters.

For more details on this technology, check their video at
“Imaging still scenery with WiFi is considerably challenging due to the lack of motion,” said Mostofi, a professor of electrical and computer engineering. “We have then taken a completely different approach to tackle this challenging problem by focusing on tracing the edges of the objects instead.” The proposed methodology and experimental results appeared in the Proceedings of the 2023 IEEE National Conference on Radar (RadarConf) on June 21, 2023.

Image

Photo Credit

Courtesy Mostofi Lab

To showcase the capabilities of the proposed pipeline in imaging complex details, the researchers have shown how WiFi can image the English alphabet, even through walls

This innovation builds on previous work in the Mostofi Lab, which since 2009 has pioneered sensing with everyday radio frequency signals such as WiFi for several different applications, including crowd analytics, person identification, smart health and smart spaces.
“When a given wave is incident on an edge point, a cone of outgoing rays emerges according to the Keller’s Geometrical Theory of Diffraction (GTD), referred to as a Keller cone,” Mostofi explained. The researchers note that this interaction is not limited to visibly sharp edges but applies to a broader set of surfaces with a small enough curvature.
“Depending on the edge orientation, the cone then leaves different footprints (i.e., conic sections) on a given receiver grid. We then develop a mathematical framework that uses these conic footprints as signatures to infer the orientation of the edges, thus creating an edge map of the scene,” Mostofi continued.

Image

Photo Credit

Courtesy Mostofi Lab

Sample imaging in non-through-wall settings: Their method can image details of letter P in ways not possible before.

More specifically, the team proposed a Keller cone-based imaging projection kernel. This kernel is implicitly a function of the edge orientations, a relationship that is then exploited to infer the existence/orientation of the edges via hypothesis testing over a small set of possible edge orientations. In other words, if existence of an edge is determined, the edge orientation that best matches the resulting Keller cone-based signature is chosen for a given point that they are interested in imaging.
“Edges of real-life objects have local dependencies,” said Anurag Pallaprolu, the lead Ph.D. student on the project. “Thus, once we find the high-confidence edge points via the proposed imaging kernel, we then propagate their information to the rest of the points using Bayesian information propagation. This step can further help improve the image, since some of the edges may be in a blind region, or can be overpowered by other edges that are closer to the transmitters.” Finally, once an image is formed, the researchers can further improve the image by using image completion tools from the area of vision.
“It is worth noting that traditional imaging techniques result in poor imaging quality when deployed with commodity WiFi transceivers,” added Pallaprolu, “as the surfaces can appear near-specular at lower frequencies, thus not leaving enough signature on the receiver grid.”

The researchers have also extensively studied the impact of several different parameters, such as curvature of a surface, edge orientation, distance to the receiver grid, and transmitter location on the Keller cones and their proposed edge-based imaging system, thereby developing a foundation for a methodical imaging system design.

"We have then taken a completely different approach to tackle this challenging problem by focusing on tracing the edges of the objects instead."

In the team’s experiments, three off-the-shelf WiFi transmitters send wireless waves in the area. WiFi receivers are then mounted on an unmanned vehicle that emulates a WiFi receiver grid as it moves. The receiver measures the received signal power which it then uses for imaging, based on the proposed methodology.

The researchers have extensively tested this technology with several experiments in three different areas, including through-wall scenarios. In one example application, they developed a WiFi Reader to showcase the capabilities of the proposed pipeline.

This application is particularly informative as the English alphabet presents complex details which can be used to test the performance of the imaging system. Along this line, the group has shown how they can successfully image several alphabet-shaped objects. In addition to imaging, they can further classify the letters. Finally, they have shown how their approach enables WiFi to image and read through walls by imaging the details and further reading the letters of the word “BELIEVE” through walls. They have furthermore imaged a number of other objects as well, showing that they can capture details previously not possible with WiFi.

Overall, the proposed approach can open up new directions for RF imaging.

Image

Photo Credit

Courtesy Mostofi Lab

From left to right: Ph.D. student Anurag Pallaprolu; former Ph.D. student Belal Korany and Professor Yasamin Mostofi

More information about the project can be found at Reading Through Walls With WiFi

Additional information about Mostofi’s research is available at http://www.ece.ucsb.edu/~ymostofi/.

Mostofi can be reached at ymostofi@ece.ucsb.edu.

bnew · Sep 11, 2023

GitHub - FasterDecoding/Medusa: Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads - GitHub - FasterDecoding/Medusa: Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decodi...

github.com

About

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

sites.google.com/view/medusa-llm

Introduction

Medusa is a simple framework that democratizes the acceleration techniques for LLM generation with multiple decoding heads.

Medusa on Vicuna-7b.

We aim to tackle the three pain points of popular acceleration techniques like speculative decoding:

Requirement of a good draft model.
System complexity.
Inefficiency when using sampling-based genenration.

Medusa adds extra "heads" to LLMs to predict multiple future tokens simultaneously. When augmenting a model with Medusa, the original model stays untouched, and only the new heads are fine-tuned during. During generation, these heads each produce multiple likely words for the corresponding position. These options are then combined and processed using a tree-based attention mechanism. Finally, a typical acceptance scheme is employed to pick the longest plausible prefix from the candidates for further decoding.

In a nutshell, we solve the challenges of speculative decoding with the following ideas:

Instead of introducing a new model, we train multiple decoding heads on the same model.
The training is parameter-efficient so that even GPU poor can do it. And since there is no additional model, there is no need to adjust the distributed computing setup.
Relaxing the requirement of matching the distribution of the original model makes the non-greedy generation even faster than greedy decoding.

In this initial release, our primary focus is on optimizing Medusa for a batch size of 1—a setting commonly utilized for local model hosting. In this configuration, Medusa delivers approximately a 2x speed increase across a range of Vicuna models. We are actively working to extend Medusa's capabilities by integrating it into additional inference frameworks, with the aim of achieving even greater performance gains and extending Medusa to broader settings.

bnew · Sep 11, 2023

Another resource

Home

Welcome to llm-tracker.info. This is a personal site I started to keep my notes focused around Large Language Models and Generative AI. If you are looking for a pure list of open LLMs, check out Extractum’s LLM Explorer (it’s sortable/filterable! Here’s a view of the current top of the (heavily...

llm-tracker.info

bnew · Sep 12, 2023

[2309.05463] Textbooks Are All You Need II: phi-1.5 technical report

Textbooks Are All You Need II: phi-1.5 technical report

https://arxiv.org/pdf/2309.05463.pdf

Yuanzhi Li, Sébastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee

We continue the investigation into the power of smaller Transformer-based language models as initiated by \textbf{TinyStories} -- a 10 million parameter model that can produce coherent English -- and the follow-up work on \textbf{phi-1}, a 1.3 billion parameter model with Python coding performance close to the state-of-the-art. The latter work proposed to use existing Large Language Models (LLMs) to generate ``textbook quality" data as a way to enhance the learning process compared to traditional web data. We follow the ``Textbooks Are All You Need" approach, focusing this time on common sense reasoning in natural language, and create a new 1.3 billion parameter model named \textbf{phi-1.5}, with performance on natural language tasks comparable to models 5x larger, and surpassing most non-frontier LLMs on more complex reasoning tasks such as grade-school mathematics and basic coding. More generally, \textbf{phi-1.5} exhibits many of the traits of much larger LLMs, both good -- such as the ability to ``think step by step" or perform some rudimentary in-context learning -- and bad, including hallucinations and the potential for toxic and biased generations -- encouragingly though, we are seeing improvement on that front thanks to the absence of web data. We open-source \textbf{phi-1.5} to promote further research on these urgent topics.

microsoft/phi-1_5 · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

voiture · Sep 12, 2023

Does anyone know which AI program is used to create these types of videos? I understand the voice is even AI

bnew · Sep 12, 2023

voiture said:
Does anyone know which AI program is used to create these types of videos? I understand the voice is even AI

probably midjourney for the images but i don't know what they're using for the video effects which makes the still images more lively with the commentary. if the voice is A.I than they likely used so-vits-svc or whisper.

bnew · Sep 12, 2023

Using AI to Decode Animal Communication with Aza Raskin

76,397 views Aug 1, 2023
From crows to dolphins, gelada monkeys to primrose flowers - Aza Raskin, co-founder of Earth Species Project, shares how the latest advances in AI help us to better understand and learn from other species. In this talk, learn how our ability to communicate with other species could transform the way humans relate to the rest of nature. This talk was recorded at Summit At Sea in May 2023.

bnew · Sep 12, 2023

https://archive.ph/lPKAI

Roblox is about to onboard over 200M people to AI.

They just revealed a new AI chatbot called Roblox Assistant, allowing creators build virtual worlds just by typing prompts.

What's even more fascinating, is that over 43% of Roblox users are under the age of 13.

The next-gen of builders will be learning how to talk with AI revolution at a VERY young age.

We're headed into a new era.

bnew · Sep 12, 2023

https://archive.ph/Azux3

Fine-tune Falcon 180B with QLoRA and Flash Attention on Amazon SageMaker

Learn how to fine-tune Falcon 180B with QLoRA and Flash Attention on Amazon SageMaker.

www.philschmid.de

bnew · Sep 12, 2023

bnew · Sep 12, 2023

https://archive.ph/oSQe0

bnew · Sep 13, 2023

Are Emergent Abilities in Large Language Models just In-Context Learning?

Large language models, comprising billions of parameters and pre-trained on extensive web-scale corpora, have been claimed to acquire certain capabilities without having been specifically trained on them. These capabilities, referred to as "emergent abilities," have been a driving force in...

arxiv.org

Are Emergent Abilities in Large Language Models just In-Context Learning?

Large language models have exhibited emergent abilities, demonstrating exceptional performance across diverse tasks for which they were not explicitly trained, including those that require complex reasoning abilities. The emergence of such abilities carries profound implications for the future direction of research in NLP, especially as the deployment of such models becomes more prevalent. However, one key challenge is that the evaluation of these abilities is often confounded by competencies that arise in models through alternative prompting techniques, such as in-context learning and instruction following, which also emerge as the models are scaled up. In this study, we provide the first comprehensive examination of these emergent abilities while accounting for various potentially biasing factors that can influence the evaluation of models. We conduct rigorous tests on a set of 18 models, encompassing a parameter range from 60 million to 175 billion parameters, across a comprehensive set of 22 tasks. Through an extensive series of over 1,000 experiments, we provide compelling evidence that emergent abilities can primarily be ascribed to in-context learning. We find no evidence for the emergence of reasoning abilities, thus providing valuable insights into the underlying mechanisms driving the observed abilities and thus alleviating safety concerns regarding their use.

https://arxiv.org/pdf/2309.01809.pdf

bnew · Sep 13, 2023

SadimirPutin · Sep 13, 2023

Has anyone developed an AI that produces good looking Powerpoint presentations

bnew · Sep 13, 2023

SadimirPutin said:
Has anyone developed an AI that produces good looking Powerpoint presentations

Plus AI for Google Slides™ - Google Workspace Marketplace

Stop making presentations the old way. Make presentations easier with the most comprehensive set of AI slide tools.

workspace.google.com

The A.I Megathread (LLM , GPT , Development)

Veteran

Wifi can read through walls​

Veteran

About​

Introduction​

Veteran

Veteran

Textbooks Are All You Need II: phi-1.5 technical report​

Superstar

Veteran

Veteran

Using AI to Decode Animal Communication with Aza Raskin​

Veteran

Veteran

Veteran

Veteran

Veteran

Are Emergent Abilities in Large Language Models just In-Context Learning?​

Veteran

Superstar

Veteran

Wifi can read through walls

About

Introduction

Textbooks Are All You Need II: phi-1.5 technical report

Using AI to Decode Animal Communication with Aza Raskin

Are Emergent Abilities in Large Language Models just In-Context Learning?