1/3
@rohanpaul_ai
LLMs gain human-like awareness of word positions through numbered tracking.
Adding position markers to LLM inputs enables exact length control and accurate text manipulation.
**Original Problem**
:
LLMs struggle with length control and precise copy-paste operations due to lack of positional awareness.
The authors identify a lack of positional awareness as the root cause of LLMs' inability to effectively control text length. This stems from token-level operations and insufficient training on data with strict length limitations.
-----
**Solution in this Paper**
:
• PositionID Prompting: Assigns sequential IDs to words/sentences/paragraphs during generation
• PositionID Fine-Tuning: Trains models on mixed normal and PositionID modes
• PositionID CP Prompting: Enables accurate copy-paste using a three-stage tool-use mechanism
-----
**Key Insights from this Paper**
:
• Explicit positional awareness enhances LLMs' length control and copy-paste abilities
• PositionID techniques work for both closed-source and open-source models
• Mixed-mode training transfers positional awareness to normal generation mode
-----
**Results**
:
• PositionID Prompting: Best Rouge-L (23.2) and MAE scores across all levels
• PositionID Fine-Tuning: Outperforms CFT and InstructCTG in MAE metrics
• PositionID CP Prompting: 80.8% CP Success Rate, 18.4 Rouge-L, 8.4 PPL
2/3
@rohanpaul_ai
LenCtrl-Bench Details
This component has three workflow variants:
Vanilla Prompting:
- Takes user query and length constraint
- Directly generates text without position tracking
- Less accurate length control
PositionID Prompting:
- Adds sequential position IDs to each word/token
- Helps model track length during generation
- More precise length control
- Example: "Three 1 -word 2 text 3"
PositionID Fine-Tuning:
- Trains model in two modes:
- Normal mode (without position IDs)
- PositionID mode (with position IDs)
- Infers in normal mode while retaining positional awareness
- Most effective for length control
3/3
@rohanpaul_ai
[2410.07035] PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
@rohanpaul_ai
LLMs gain human-like awareness of word positions through numbered tracking.
Adding position markers to LLM inputs enables exact length control and accurate text manipulation.
**Original Problem**
data:image/s3,"s3://crabby-images/8476d/8476d58b8a576d7f3dec2c0472b1857fa85bbc47" alt="Magnifying glass tilted left :mag: 🔍"
LLMs struggle with length control and precise copy-paste operations due to lack of positional awareness.
The authors identify a lack of positional awareness as the root cause of LLMs' inability to effectively control text length. This stems from token-level operations and insufficient training on data with strict length limitations.
-----
**Solution in this Paper**
data:image/s3,"s3://crabby-images/d9412/d941242f6b26abdd3e7a4f12e27d521e2382c365" alt="Hammer and wrench :tools: 🛠️"
• PositionID Prompting: Assigns sequential IDs to words/sentences/paragraphs during generation
• PositionID Fine-Tuning: Trains models on mixed normal and PositionID modes
• PositionID CP Prompting: Enables accurate copy-paste using a three-stage tool-use mechanism
-----
**Key Insights from this Paper**
data:image/s3,"s3://crabby-images/85456/85456b39d572a2920ce230866856306b113f1baf" alt="Light bulb :bulb: 💡"
• Explicit positional awareness enhances LLMs' length control and copy-paste abilities
• PositionID techniques work for both closed-source and open-source models
• Mixed-mode training transfers positional awareness to normal generation mode
-----
**Results**
data:image/s3,"s3://crabby-images/25ef7/25ef702f8f8d094f600a6bd7b77ea897cd3deb89" alt="Bar chart :bar_chart: 📊"
• PositionID Prompting: Best Rouge-L (23.2) and MAE scores across all levels
• PositionID Fine-Tuning: Outperforms CFT and InstructCTG in MAE metrics
• PositionID CP Prompting: 80.8% CP Success Rate, 18.4 Rouge-L, 8.4 PPL
data:image/s3,"s3://crabby-images/1bc3d/1bc3dcb5e0f1b580025d8b0fc3aa026089b277d8" alt="GarS0HTWkAE-f6S.png"
2/3
@rohanpaul_ai
data:image/s3,"s3://crabby-images/9fac2/9fac2c27c1df678815296f71063271440f28114f" alt="Memo :pencil: 📝"
This component has three workflow variants:
data:image/s3,"s3://crabby-images/58a4f/58a4fe1c5a2803e1a818b134600eb46ea873212c" alt="Backhand index pointing right :point_right: 👉"
- Takes user query and length constraint
- Directly generates text without position tracking
- Less accurate length control
data:image/s3,"s3://crabby-images/58a4f/58a4fe1c5a2803e1a818b134600eb46ea873212c" alt="Backhand index pointing right :point_right: 👉"
- Adds sequential position IDs to each word/token
- Helps model track length during generation
- More precise length control
- Example: "Three 1 -word 2 text 3"
data:image/s3,"s3://crabby-images/58a4f/58a4fe1c5a2803e1a818b134600eb46ea873212c" alt="Backhand index pointing right :point_right: 👉"
- Trains model in two modes:
- Normal mode (without position IDs)
- PositionID mode (with position IDs)
- Infers in normal mode while retaining positional awareness
- Most effective for length control
data:image/s3,"s3://crabby-images/52f78/52f78a35f6695c9f8d9409340d9476da52a5c5d5" alt="GarUsGOWMAA0KBH.png"
3/3
@rohanpaul_ai
data:image/s3,"s3://crabby-images/4b6c6/4b6c6ec45f9e314990d08db4c10651743a1c8db5" alt="Books :books: 📚"
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
1/1
@arXivGPT
:PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
:https://arxiv.org/pdf/2410.07035.pdf
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
@arXivGPT
data:image/s3,"s3://crabby-images/c48fc/c48fcaf0e9b006ed3b17d737e149446d2bcf7721" alt="Label :label: 🏷️"
data:image/s3,"s3://crabby-images/4952a/4952a9c381782f2caca6371d496f33f14a5ef899" alt="Link :link: 🔗"
data:image/s3,"s3://crabby-images/cdb8d/cdb8d94c80a3348fe961a9cd4ae4477b723b844f" alt="GZ9N37aWwAATYoq.jpg"
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196