-
-
Notifications
You must be signed in to change notification settings - Fork 204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLBlast failed to compile with CLTune enabled #61
Comments
Hello, some progress. Installing cuda toolkit 7.5. CLTune compiled successfully. Only other issue was
Deleting dup definitions resolved them. Not sure about root cause, python script or dup entries in json files? Finally, see some speedup. GEMM 8192x8192 CPU: "Elapsed time: 11074.600465 msecs" |
I am not sure how your previous issues were caused and solved, but if you think CLBlast can be improved to prevent such issues in the future let me know how you solved them and what I can do to help out. About your duplicate definitions issue: which branch did you check out? The master branch? I cannot find the duplicate definition, but maybe I am looking at the wrong place. About the speed-up results: did you also actually run the tuner first? If you did, can you upload the results here: #1. That way it will be fast out-of-the-box in the next version for your GPU! If not, can you run them as well? Also, are the CPU times also using CLBlast or using some other BLAS library? |
The root cause of previous issues, undefined reference to "...@OPENCL_1.0" symbol in my libclblast.so build, was that I upgraded nvidia driver in aws g2 ami, but didn't know I need to upgrade old cuda sdk, which has OPENCL_1.0 headers in /opt/nvidia/cuda. Installing cuda sdk 7.5 resolved that issue as your README mentions.
Yes, I checked out master branch. Duplicate definitions only showed up after running below two steps:
I saw NVIDIA GRID K520 device got inserted into database/*.h files. However, there was new Database::*Single also inserted. Here is a sample snippet. Could it be an issue in python json parsing or code generation? I can reproduce it in my laptop and aws g2 environment. Let me know if more detail would help.
Yes, I did run "make alltuners; python ...; make". Though I noticed a few tuner rules with certain parameters output "FAILED", overall build completed successfully. I'll upload the result soon. Thank you very much for the help! Regards, |
Thanks for the details Feng! The issue you are seeing is a mistake on my side, sorry for that. The 'database' of tuner results ( To fix it for now (if you want), you can replace I will add your tuning results (after you've uploaded the .JSON files) to CLBlast such that it is included in the next release (also of Neanderthal). About the tuner failing once a while: that's OK. It will automatically filter out those results. |
Thank you for looking into the issue! I'll make a clean build today, and upload tuner .json files at #1 . Regards, |
@CNugteren, tried what you suggested, still getting the same issue. Not urgent as editing database/*.h seems working fine for me to make a local build. Since you are working on fp16 dev branch right now, I'll wait and run it again when you have next release ready. Just to log some details here. First, removed everything, and checkout a fresh copy of CLBlast master branch. Modify DATABASE_SERVER_URL
Ran cmake ...; make; make alltuners while running python ... did see it was downloading previous db
Still getting redefinition errors
Here is git diff at this point. |
Sorry, I didn't test it myself but guessed that that version would work based on the date. I tested myself and indeed, it doesn't. This one does not include the fp16 results, I just tested it: Alternatively, you can just checkout the By the way, it will probably take another few weeks before a new release (0.8.0) will be available. |
Built master branch successful again 2016_05_08_clblast.db. Thanks! Here are some quick benchmark. CPU number is from single-thread ATLAS compiled on the same aws g2 instance. 20x speed-up!
|
OK, I guess we can close this issue then. The database issue will remain until a new version of CLBlast is released, but this will not impact a regular user of the library. |
Hello, Please see for detail.
uncomplicate/neanderthal#17
The text was updated successfully, but these errors were encountered: