Reduce string allocations when formatting file system objects. #8831

powercode · 2019-02-05T16:41:15Z

PR Summary

Cache FullName property in the ProviderInfo class. Causes extra string allocations for every item in the formatting pipeline.

PR Context

PR Checklist

adityapatwardhan · 2019-02-05T17:46:06Z

@powercode Please have a look at the test failure.

iSazonov · 2019-02-06T11:14:35Z

src/System.Management.Automation/engine/DataStoreAdapterProvider.cs

@@ -136,6 +143,7 @@ public string ModuleName
        internal void SetModule(PSModuleInfo module)
        {
            Module = module;
+            _fullName = null; // clear the cached FullName


Please remove the comment. Better use _cacheFullModuleName name.

But it isn't a full module name. That name would be misleading. But sure, I can remove the comment.

If it is not a full name what about _cachedModuleName?

I see there comment "Gets the name of the provider." So the name could be appropriate - _cachedProviderName.

The property is FullName, so it makes sense to me that the backing property is _fullName.

I tried to insert your comment to the name that it is a cached :-)

bergmeister · 2019-02-06T22:00:32Z

Awesome. Out of curiosity: Do you have time measurements of cases where this improvement is noticeable?

powercode · 2019-02-06T22:47:25Z

@bergmeister It's mostly in allocations - it generates lots of extra strings, like 2 strings per item in the pipeline.

I'm reasoning more in terms of steadily removing silly allocations, and fixing the worst offenders perf-wise, and eventually we will have a more snappy shell.

The image below show the dotmemory view of strings allocated, grouped by callstack, when looking at the first 50000 items in the windows dir. It wasn't the biggest, but it was one of the bigger, and it was low hanging fruit. Does it make sense?

bergmeister · 2019-02-06T22:52:21Z

Thanks @powercode. Yes, makes sense. I agree that continous improvement will make it better over time. :)

powercode · 2019-02-06T23:15:45Z

@bergmeister It may seem like I'm just doing random changes, but I actually measure things now and then :)

dotTrace and dotMemory are almost always running on my machine.

bergmeister · 2019-02-07T18:55:28Z

@powercode I was not questioning it, I was just curious for my own education (because knowing this also means that one will know the scenarios where PSCore is stronger than Windows PowerShell meaning that I could recommend an upgrade to clients).
Would you mind having a look at issue #7603 with Import-Csv please which is causing actual OutOfMemory problems in some workflows where CSV files are a couple of GB large.

iSazonov · 2019-02-08T03:35:54Z

@bergmeister Windows PowerShell is still more efficient than PowerShell Core
in many scenarios. Main reason that .Net Framework engine works differently than .Net Core one. Optimizing individual cmdlets would be fine, but maybe we need something more general.

powercode · 2019-02-10T09:22:06Z

@bergmeister I'm currently working on optimizations for the formatting system, the filesystem provider, etc that makes it out-perform windows powershell with a huge margin, often like 4x. And with a memory footprint reduction of the similar size.

Import-Csv is problematic, since our property abstraction is heavier that I would like it to be. Especially in cases like that, where we actually have a table, we wouldn't need to store all the metadata for each object. I haven't given it so much thought - just passed by it last fall when doing related work.

I have an idea that I like to try out for Import-CSV - could drastically reduce the memory footprint.
That is to generate a dynamic assembly, with a class containing the fields of the CSV.

iSazonov · 2019-02-10T10:18:49Z

I was thinking about compiling to classes too and found some problems in the approach. One problem could be solved by #8852 (not trivial). After finding IDataView #8855 I suppose this is preferred way we should start research.

Reduce string allocations when formatting file system objects.

1509da0

powercode requested review from BrucePay and daxian-dbw as code owners February 5, 2019 16:41

adityapatwardhan approved these changes Feb 5, 2019

View reviewed changes

iSazonov approved these changes Feb 5, 2019

View reviewed changes

Clear cache FullName when the module changes.

4874053

iSazonov reviewed Feb 6, 2019

View reviewed changes

Removing comment

9e68f73

iSazonov self-assigned this Feb 7, 2019

iSazonov added the CL-CodeCleanup Indicates that a PR should be marked as a Code Cleanup change in the Change Log label Feb 7, 2019

iSazonov merged commit ec88045 into PowerShell:master Feb 7, 2019

SeeminglyScience mentioned this pull request Jan 31, 2020

Get-ChieldItem fails with "Cannot find a provider ..." on version 6.2.1 #9840

Closed

This was referenced Feb 4, 2020

Avoid caching ProviderInfo.FullName at init time #11761

Closed

Set correct PSProvider full name #11813

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce string allocations when formatting file system objects. #8831

Reduce string allocations when formatting file system objects. #8831

powercode commented Feb 5, 2019 •

edited

adityapatwardhan commented Feb 5, 2019

iSazonov Feb 6, 2019

powercode Feb 6, 2019

iSazonov Feb 6, 2019 •

edited

powercode Feb 6, 2019

iSazonov Feb 7, 2019

bergmeister commented Feb 6, 2019 •

edited

powercode commented Feb 6, 2019 •

edited

bergmeister commented Feb 6, 2019

powercode commented Feb 6, 2019 •

edited

bergmeister commented Feb 7, 2019 •

edited

iSazonov commented Feb 8, 2019

powercode commented Feb 10, 2019 •

edited

iSazonov commented Feb 10, 2019

Reduce string allocations when formatting file system objects. #8831

Reduce string allocations when formatting file system objects. #8831

Conversation

powercode commented Feb 5, 2019 • edited

PR Summary

PR Context

PR Checklist

adityapatwardhan commented Feb 5, 2019

iSazonov Feb 6, 2019

Choose a reason for hiding this comment

powercode Feb 6, 2019

Choose a reason for hiding this comment

iSazonov Feb 6, 2019 • edited

Choose a reason for hiding this comment

powercode Feb 6, 2019

Choose a reason for hiding this comment

iSazonov Feb 7, 2019

Choose a reason for hiding this comment

bergmeister commented Feb 6, 2019 • edited

powercode commented Feb 6, 2019 • edited

bergmeister commented Feb 6, 2019

powercode commented Feb 6, 2019 • edited

bergmeister commented Feb 7, 2019 • edited

iSazonov commented Feb 8, 2019

powercode commented Feb 10, 2019 • edited

iSazonov commented Feb 10, 2019

powercode commented Feb 5, 2019 •

edited

iSazonov Feb 6, 2019 •

edited

bergmeister commented Feb 6, 2019 •

edited

powercode commented Feb 6, 2019 •

edited

powercode commented Feb 6, 2019 •

edited

bergmeister commented Feb 7, 2019 •

edited

powercode commented Feb 10, 2019 •

edited