Pytorch 1.12 #122

hmaarrfk · 2022-07-16T15:56:26Z

Closes #123 <--- Maybe merge that one instead
Closes #120
Closes #107 (Thank you @ngam)

Builds locally for:

CPU and linux64 bit.
GPU and linux64 (cuda 11.2)

Checklist

Used a personal fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
Reset the build number to 0 (if the version changed)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

conda-forge-linter · 2022-07-16T15:56:29Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

ngam · 2022-07-16T18:20:15Z

Please see #118 for earlier attempt(s). I am happy to help here, though I will likely be limited in my ability to diagnose the problem I faced in #118 ...

Thank you for opening this!

(Also, people like isuruf and others pointed out that we could try to build without differentiating mps variants --- I could not get that to work, but it should be doable. I personally vote for differentiating them until we figure out a better way...)

Edit 2: I expect all these builds to fail with the same issue in #118 except for the MPS ones according in your matrix. That's exactly where I got stuck...

ngam · 2022-07-16T18:36:51Z

conda-forge.yml

    pool:
-      vmImage: windows-2019
-  store_build_artifacts: true
+      vmImage: macos-12


Because of this, i.e. macos-12, we get a collection of the most recent macos SDKs exposed (included in this macos-12 image). The way the logic is set up upstream, is that it will ignore your USE_MPS flag in build_pytorch.sh and just automatically turn it on regardless. Hence, if you go through the logs, you will see MPS is on for all the builds...

-- USE_LEVELDB : OFF -- USE_LITE_PROTO : OFF -- USE_LMDB : OFF -- USE_METAL : OFF -- USE_PYTORCH_METAL : OFF -- USE_PYTORCH_METAL_EXPORT : OFF -- USE_MPS : ON -- USE_FFTW : OFF -- USE_MKL : ON -- USE_MKLDNN : ON -- USE_NCCL : OFF -- USE_NNPACK : ON -- USE_NUMPY

The logic upstream is here: https://github.com/pytorch/pytorch/blob/b370959da11fe1c5fe9a6a6335a9b10ab6ad0a39/CMakeLists.txt#L90-L124. I cannot follow it all.

I can try to help you with this OSX stuff as we go forward, but the point is: If we build them all with macos-12, we no longer need to differentiate them. Eventually, this means that we are building for MPS and then on the user's devices, it will work if the user has macos12.3 and the associated SDKs.

However, the issue I faced was such that, yes, the MPS flag gets activated, but then it fails in the build if the correct SDK is not exposed (note this is different from the initial exposure, because we control this in our builds as we set exactly the SDK expected). So somehow, the fact that the image is macos-12 indicates to the pytorch logic that it is expecting the recent SDKs (12+) and if we don't have them exposed in our matrix, the build fails.

Obviously, I could've been doing something so wrong, so maybe you will have better luck.

We currently have no mechanism to set different macos images for different builds. At least not that I am aware of... We potentially need to edit the pipelines yaml file ...

Thank you for pointing this out. I think maybe we can set MKS_FOUND=0 or something like that. I'm going to let these builds run since I think linux should now pass. I think we had been disabling MKLDNN for a while now.... unintentionally.

We will have to start to build with macos-12 conda-forge/conda-forge.github.io#1798 (comment) anyway....

I think once i fixed the string parsing it claims to follow the flag:

-- USE_PYTORCH_METAL_EXPORT : OFF -- USE_MPS : OFF -- USE_FFTW : OFF -- USE_MKL : -- USE_MKLDNN : 0 -- USE_NCCL : OFF -- USE_NNPACK : ON -- USE_NUMPY : ON

Ah! So the problem was that one needed to negate the default, okay makes more senes. I didn't think of that before 😳

The main section said you built it locally for linux cpu? Did the find_path thing fix it? Wow!!!

I didn't think of that before

You did think of that. You just had your code two lines too early.

The main section said you built it locally for linux cpu? Did the find_path thing fix it? Wow!!!

Yeah, builds are quicker and I think it worked trough the docker process.

…nda-forge-pinning 2022.07.16.15.49.25

hmaarrfk requested review from benjaminrwilson and sodre as code owners July 16, 2022 15:56

hmaarrfk force-pushed the pytorch_1.12 branch from 923bf8b to 7f4b486 Compare July 16, 2022 17:35

ngam reviewed Jul 16, 2022

View reviewed changes

hmaarrfk force-pushed the pytorch_1.12 branch from 11b8fb1 to fa8e7f8 Compare July 16, 2022 20:20

hmaarrfk and others added 4 commits July 16, 2022 16:33

Update to 1.12.0

1010873

Duplicate deps in pytorch meta.yml

433639b

Build osx with MSP support

75c7709

MNT: Re-rendered with conda-build 3.21.9, conda-smithy 3.21.0, and co…

9eabe2a

…nda-forge-pinning 2022.07.16.15.49.25

hmaarrfk force-pushed the pytorch_1.12 branch from fa8e7f8 to 9eabe2a Compare July 16, 2022 20:33

hmaarrfk marked this pull request as draft July 16, 2022 20:35

hmaarrfk mentioned this pull request Jul 16, 2022

Pytorch 1.12 mps single build #123

Merged

5 tasks

hmaarrfk closed this Jul 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch 1.12 #122

Pytorch 1.12 #122

hmaarrfk commented Jul 16, 2022 •

edited

Loading

conda-forge-linter commented Jul 16, 2022

ngam commented Jul 16, 2022 •

edited

Loading

ngam Jul 16, 2022 •

edited

Loading

hmaarrfk Jul 16, 2022

hmaarrfk Jul 16, 2022

ngam Jul 16, 2022

ngam Jul 16, 2022

hmaarrfk Jul 16, 2022

Pytorch 1.12 #122

Pytorch 1.12 #122

Conversation

hmaarrfk commented Jul 16, 2022 • edited Loading

conda-forge-linter commented Jul 16, 2022

ngam commented Jul 16, 2022 • edited Loading

ngam Jul 16, 2022 • edited Loading

Choose a reason for hiding this comment

hmaarrfk Jul 16, 2022

Choose a reason for hiding this comment

hmaarrfk Jul 16, 2022

Choose a reason for hiding this comment

ngam Jul 16, 2022

Choose a reason for hiding this comment

ngam Jul 16, 2022

Choose a reason for hiding this comment

hmaarrfk Jul 16, 2022

Choose a reason for hiding this comment

hmaarrfk commented Jul 16, 2022 •

edited

Loading

ngam commented Jul 16, 2022 •

edited

Loading

ngam Jul 16, 2022 •

edited

Loading