GPU OOM during the embedding process #13428

mistoFENG · 2024-05-11T07:52:08Z

Question Validation

I have searched both the documentation and discord for an answer.

Question

As the file size changes during the embedding process, the GPU memory also increases. This will cause GPU OOM. How to avoid it? Thanks

dosubot · 2024-05-11T07:52:26Z

To manage GPU memory and avoid Out of Memory (OOM) errors during the embedding process, consider implementing the following strategies:

Limit Batch Size: Ensure your data is processed in smaller batches that do not exceed the maximum batch size limit, which is set to 259 in the NVIDIAEmbedding class. This helps in preventing excessive GPU memory usage.
Truncate Input Text: Use the truncation feature in the NVIDIAEmbedding class to keep the input text within the model's maximum token length, reducing memory usage.
Optimize Memory Usage: After each embedding operation, use torch.cuda.empty_cache() and gc.collect() to clear unused memory from the GPU and trigger garbage collection in Python, respectively. This can help in efficiently managing GPU memory.
Adjust Configurable Parameters: Experiment with adjusting parameters such as DEFAULT_EMBED_BATCH_SIZE, DEFAULT_CONTEXT_WINDOW, and others to find a balance that works best for your specific use case and helps in managing GPU memory usage.
Utilize Error Handling Mechanisms: The codebase includes error handling for embedding processes, which can be extended or adapted to manage GPU OOM errors by logging or taking recovery actions through the callback system.

By carefully managing batch sizes, input text length, and memory usage, and by adjusting configurable parameters, you can mitigate the risk of GPU OOM errors during the embedding process.

Details

Sources

About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

mistoFENG added the question Further information is requested label May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU OOM during the embedding process #13428

GPU OOM during the embedding process #13428

mistoFENG commented May 11, 2024

dosubot bot commented May 11, 2024 •

edited

Details

GPU OOM during the embedding process #13428

GPU OOM during the embedding process #13428

Comments

mistoFENG commented May 11, 2024

Question Validation

Question

dosubot bot commented May 11, 2024 • edited

Details

dosubot bot commented May 11, 2024 •

edited