What specific data exactly will be send to Copilot? #59630
Replies: 2 comments 2 replies
-
Parth Thakkar has reversed engineered the plugin in VSCode and released a blog post Copilot Internals, which explains what data is send to copilot. However, I prefer an official documentation on what exact data is send to copilot (or specified to do so) which can be used in discussions. |
Beta Was this translation helpful? Give feedback.
-
🕒 Stale Discussion Alert 🕒 This Discussion has been labeled as stale by an automated system for having no activity in the last 60 days. Please consider one the following actions: 1️⃣ Close as Out of Date: If the topic is no longer relevant, close the Discussion as 2️⃣ Provide More Information: Share additional details or context — or let the community know if you've found a solution on your own. 3️⃣ Mark a Reply as Answer: If your question has been answered by a reply, mark the most helpful reply as the solution. Note: This stale notification will only apply to Discussions with the Thank you for helping bring this Discussion to a resolution! 💬 |
Beta Was this translation helpful? Give feedback.
-
Select Topic Area
Question
Body
To understand the range of possible suggestions generated by Copilot, I would like to know the detailed technical description on which data exactly is send to Copilot. The features list on https://github.com/features/copilot only explain a very vague definition of the data being sent:
However, it doesn't specify what is send exactly.
The Privacy Statement on https://docs.github.com/en/site-policy/privacy-policies/github-copilot-for-business-privacy-statement explain vaguely again, that "Code Snippets" are send to Copilot:
Again, no exact definition of "Code snippets" that is being send to Copilot.
The official documentation for "Enabling or diabling duplication detection" on https://docs.github.com/en/copilot/configuring-github-copilot/configuring-github-copilot-settings-on-githubcom#enabling-or-disabling-duplication-detection say that "about 150 characters" around the current location is checked:
But this might only be used for this specific case of finding duplicate code against public github repositories.
So the question is: What exact data is send to Copilot to generate suggestions?
It looks like neither the official documentation on https://docs.github.com/en/copilot nor the feature list on https://github.com/features/copilot specifically explain what exact data is send to Copilot. Usage experience suggest, that some content from other open tabs are send, but I'm not sure about that. Or it has some other ways to "remember" what code was previously seen or used.
Beta Was this translation helpful? Give feedback.
All reactions