‘Jobs may disappear’: Nearly 40% of global employment could be disrupted by AI, IMF says

O.T.I.S. · Jul 1, 2024

bnew said:
they have thousands, possibly hundreds of thousands of hours of recorded phone calls, email correspondences, documentation and other digitized work product to train from.

a lot of companies are sitting on a ton of data they can train AI models on to tailor to their specific needs.

Good for them

Doesn't mean that shyt will still work as people with this AI doom and gloom expects

It’s funny because AI has BEEN around.. why didnt it take over tech jobs decades ago?

bnew · Jul 1, 2024

O.T.I.S. said:
Good for them

Doesn't mean that shyt will still work as people with this AI doom and gloom expects

It’s funny because AI has BEEN around.. why didnt it take over tech jobs decades ago?

cars been around for over a hundred years, a car made in 2024 is superior to a car made in 1920.

AI models made today are way better than models made just a year ago. do you think this technology has peaked?

Huda2daf · Jul 1, 2024

O.T.I.S. said:
Thats nowhere near tech though

Data entry at best

Huh only tech jobs count? I am confused by your response

TM101 · Jul 1, 2024

bnew said:
cars been around for over a hundred years, a car made in 2024 is superior to a car made in 1920.

AI models made today are way btter than models made just a year ago. do you think this technology has peaked?

I wouldn't say it's peaked, more so the transformer model will never be intelligent enough to do work on its own. They've fed it all the data we have on the Internet and it still hallucinates. It's a black box that spouts out random stuff that can't be fixed by adding more CPU and chips.

bnew · Jul 1, 2024

bnew · Jul 1, 2024

TM101 said:
I wouldn't say it's peaked, more so the transformer model will never be intelligent enough to do work on its own. They've fed it all the data we have on the Internet and it still hallucinates. It's a black box that spouts out random stuff that can't be fixed by adding more CPU and chips.

it hasn't been fed nearly all the information on the internet, not even close. there are sites and forums thats aren't indexable by the web like parts of thecoli that are not available to non-registered users.

A.I is increasingly being trained on high quality textbooks generated by A.I.

edit:

the transformer architecture isn't the only one thats been developed...

[2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces

arxiv.org

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Albert Gu, Tri Dao

Foundation models, now powering most of the exciting applications in deep learning, are almost universally based on the Transformer architecture and its core attention module. Many subquadratic-time architectures such as linear attention, gated convolution and recurrent models, and structured state space models (SSMs) have been developed to address Transformers' computational inefficiency on long sequences, but they have not performed as well as attention on important modalities such as language. We identify that a key weakness of such models is their inability to perform content-based reasoning, and make several improvements. First, simply letting the SSM parameters be functions of the input addresses their weakness with discrete modalities, allowing the model to selectively propagate or forget information along the sequence length dimension depending on the current token. Second, even though this change prevents the use of efficient convolutions, we design a hardware-aware parallel algorithm in recurrent mode. We integrate these selective SSMs into a simplified end-to-end neural network architecture without attention or even MLP blocks (Mamba). Mamba enjoys fast inference (5× higher throughput than Transformers) and linear scaling in sequence length, and its performance improves on real data up to million-length sequences. As a general sequence model backbone, Mamba achieves state-of-the-art performance across several modalities such as language, audio, and genomics. On language modeling, our Mamba-3B model outperforms Transformers of the same size and matches Transformers twice its size, both in pretraining and downstream evaluation.

https://www.researchgate.net/publication/381009719_Hydra_Enhancing_Machine_Learning_with_a_Multi-head_Predictions_Architecture

A recurrent network model of planning explains hippocampal replay and human behavior

When faced with a novel situation, humans often spend substantial periods of time contemplating possible futures. For such planning to be rational, the benefits to behavior must compensate for the time spent thinking. Here we capture these features of human behavior by developing a neural...

www.biorxiv.org

Monarch Mixer: A new model architecture for increased efficiency

www.together.ai

[2310.12109] Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré

Machine learning models are increasingly being scaled in both sequence length and model dimension to reach longer contexts and better performance. However, existing architectures such as Transformers scale quadratically along both these axes. We ask: are there performant architectures that can scale sub-quadratically along sequence length and model dimension? We introduce Monarch Mixer (M2), a new architecture that uses the same sub-quadratic primitive along both sequence length and model dimension: Monarch matrices, a simple class of expressive structured matrices that captures many linear transforms, achieves high hardware efficiency on GPUs, and scales sub-quadratically. As a proof of concept, we explore the performance of M2 in three domains: non-causal BERT-style language modeling, ViT-style image classification, and causal GPT-style language modeling. For non-causal BERT-style modeling, M2 matches BERT-base and BERT-large in downstream GLUE quality with up to 27% fewer parameters, and achieves up to 9.1× higher throughput at sequence length 4K. On ImageNet, M2 outperforms ViT-b by 1% in accuracy, with only half the parameters. Causal GPT-style models introduce a technical challenge: enforcing causality via masking introduces a quadratic bottleneck. To alleviate this bottleneck, we develop a novel theoretical view of Monarch matrices based on multivariate polynomial evaluation and interpolation, which lets us parameterize M2 to be causal while remaining sub-quadratic. Using this parameterization, M2 matches GPT-style Transformers at 360M parameters in pretraining perplexity on The PILE--showing for the first time that it may be possible to match Transformer quality without attention or MLPs.

RWKV Language Model

The RWKV Language Model

www.rwkv.com

RWKV (pronounced RwaKuv) is an RNN with GPT-level LLM performance, and can also be directly trained like a GPT transformer (parallelizable). We are at RWKV v6.
So it's combining the best of RNN and transformer - great performance, fast inference, fast training, saves VRAM, "infinite" ctxlen, and free text embedding. Moreover it's 100% attention-free, and a LFAI project.

[2404.05892] Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

duncanthetall · Jul 1, 2024

I’ll be dead way way way before some AI powered android can rewire a house or troubleshoot some bullshyt in an attic. Y’all enlightened desk individuals needa watch out tho

bnew · Jul 1, 2024

duncanthetall said:
I’ll be dead way way way before some AI powered android can rewire a house or troubleshoot some bullshyt in an attic. Y’all enlightened desk individuals needa watch out tho

3rdWorld · Jul 1, 2024

Cacs screaming that ethnic minorities will not replace them, but created androids to replace them instead :mjlol:

bnew · Jul 1, 2024

Luke Cage said:
Everytime new technology becomes prominent people get concerned about it taking jobs.
And often it does do that, but what people seem to forget, is that it usually leads to the creation of new jobs as well.

https://www.weforum.org/agenda/2023/09/jobs-ai-will-create/

https://archive.is/seCDz

https://www3.weforum.org/docs/WEF_Jobs_of_Tomorrow_Generative_AI_2023.pdf

I read it and I think the trainers, explainers and sustainers it talks about can all inevitably be replace by AI. I mean people are using AI today to explain a myriad of subjects and Nvidia has already stated how their last few chips wouldn't be possible without AI.

BaggerofTea · Jul 1, 2024

duncanthetall said:
I’ll be dead way way way before some AI powered android can rewire a house or troubleshoot some bullshyt in an attic. Y’all enlightened desk individuals needa watch out tho

with infrared camera and ai reading imagines, humans would be able to do most repairs themselves with maybe some robotic assistance

Luke Cage · Jul 1, 2024

bnew said:
https://archive.is/seCDz

https://www3.weforum.org/docs/WEF_Jobs_of_Tomorrow_Generative_AI_2023.pdf

I read it and I think the trainers, explainers and sustainers it talks about can all inevitably be replace by AI. I mean people are using AI today to explain a myriad of subjects and Nvidia has already stated how their last few chips wouldn't be possible without AI.

I work in accounting, and for every feature that enables us to automated the work we do, there is a need for someone to regulate and manage the automation. Expand the scope, or shrink it depending on the goals you have for a particular fiscal period. This evitably happens.
Not to mention giving birth to entirely new industries as a side effect. Newpapers didn't didn't disappear, they were replaced by billion dollar social media industries and the like.

duncanthetall · Jul 1, 2024

BaggerofTea said:
with infrared camera and ai reading imagines, humans would be able to do most repairs themselves with maybe some robotic assistance

Your average human is scared to change a wall plate cover and thinks they’ll get shocked from the ground. I’m not worried :skip:

duncanthetall · Jul 1, 2024

bnew said:

Oh you think they’re gonna be using androids in the next 30 years more cost effective than flesh and blood humans? You nikkas are unhinged. :laff:

bnew · Jul 1, 2024

duncanthetall said:
Oh you think they’re gonna be using androids in the next 30 years more cost effective than flesh and blood humans? You nikkas are unhinged.

back in 2000 a compaq PC with 256MB ram and 20GB HDD cost $2400, today you can go on amazon and find a mobile device with way better specs for $50.

a lot can happen in 30 years. :manny:

‘Jobs may disappear’: Nearly 40% of global employment could be disrupted by AI, IMF says

More options

O.T.I.S.

Veteran

bnew

Veteran

Huda2daf

Superstar

TM101

All Star

bnew

Veteran

bnew

Veteran

[2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

A recurrent network model of planning explains hippocampal replay and human behavior

Monarch Mixer: A new model architecture for increased efficiency

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

RWKV Language Model

duncanthetall

Veteran

bnew

Veteran

3rdWorld

Veteran

bnew

Veteran

BaggerofTea

Veteran

Luke Cage

Coffee Lover

duncanthetall

Veteran

duncanthetall

Veteran

bnew

Veteran

Similar threads

‘Jobs may disappear’: Nearly 40% of global employment could be disrupted by AI, IMF says

Veteran

Veteran

Superstar

All Star

Veteran

Veteran

Mamba: Linear-Time Sequence Modeling with Selective State Spaces​

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture​

Veteran

Veteran

Veteran

Veteran

Veteran

Coffee Lover

Veteran

Veteran

Veteran

Similar threads

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture