The A.I Megathread (LLM , GPT , Development)

bnew · Feb 7, 2024

GitHub - argmaxinc/WhisperKit: Swift native on-device speech recognition with Whisper for iOS and macOS

Swift native on-device speech recognition with Whisper for iOS and macOS - GitHub - argmaxinc/WhisperKit: Swift native on-device speech recognition with Whisper for iOS and macOS

github.com

bnew · Feb 7, 2024

Alapaca-Eval is a benchmark that uses a reference model to emulate human interactions and determine the extent to which an A.I. model being tested delivers results in line with the baseline. It also provides users with a leaderboard to track their tests, and today's benchmarks show that Qwen 1.5's Alapaca-Eval performance only lags behind GPT-4 Turbo and New York based HuggingFace's Yi-34B.

Qwen1.5 is one of the largest open source models of its kind, and it's backed by Alibaba's massive computing resources. An open source A.I., like open source software, makes its code available to users and developers so that they can understand the model and make their own variants. Meta's Llama, also present in today's scores, is also an open source model.

The start of 2024 has seen renowned focus from Wall Street and companies on A.I. Earnings reports of mega cap technology giants such as Meta, Microsoft and Alphabet have all focused on A.I. Meta's chief Mark Zuckerberg aims to buy hundreds of thousands of GPUs this year to power up Llama, and at the firm's earnings call the executive explained that his decision to beef up computing capacity at Meta follows earlier oversights that led to the firm being under capacity.

Similarly, earnings from chip makers and designers TSMC and AMD have also seen their managements express optimism for the future of A.I. TSMC's management is confident that the firm has stable footing to capture any A.I. demand, while AMD is of the view that A.I. can end up becoming worth hundreds of billions of dollars by the end of the decade.[/SIZE]

bnew · Feb 7, 2024

YouTube

Enjoy the videos and music that you love, upload original content and share it all with friends, family and the world on YouTube.

www.youtube.com

bnew · Feb 7, 2024

bnew · Feb 7, 2024

[2401.06118] Extreme Compression of Large Language Models via Additive Quantization

GitHub - Vahe1994/AQLM: Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf - GitHub - Vahe1994/AQLM: Official Pytorch repository for...

github.com

bnew · Feb 7, 2024

GitHub - OpenGVLab/InternVL: InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B - GitHub - OpenGVLab/InternVL: InternVL: Scaling up Vision Fo...

github.com

About

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B