-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
File descriptors are not all released in vbs2file #37
Comments
Thank you for reporting this issue! We will look into it. If I am correct you want to create single files for each of the "vbs"-style scattered recordings, right? After compiling the Let me know if this helps you. |
Thank you very much for your quick response.
We have tested the VBS tools you suggested and was successfully worked!
Since the usage of the tool was a little bit different from m5copy which is the one currently using, we will compare and check which would suit more for Ishioka operation.
Thank you for your helpful advice!
Please stay healthy, Merry Christmas and a happy new year!
|
HarunaF: After scan_set= + disk2file one (1) file descriptor was leaked, after many disk2files run out of max open files! Indeed, this is caused by scan_set= caching the open file for later use; vbs_read() will start /really/ opening the file(s). Probably better to close the whole file after use; this means that after a disk2file the scan as set by "scan_set=" is invalidated/has become unusable for e.g. scan_check? Whether that is desirable needs to be investigated - if the scan_set= scan should remain accessible after disk2file= then we need to come up with a different solution to prevent leaking file descriptors.
Hi HF-vlbi, I have created a new branch issue-37. Can you try that version? Could you verify that this fixes the issue? Maybe it provokes other usability problems that I have not foreseen. |
We are also seeing this with transfer vbs: ===> file: Unacceptable return code 4 from query 'runtime=4iS5QPxG:transient' - !runtime = 4 : /home/observer/jive5ab_src/jive5ab/src/mountpoint.cc@837 [(mtab=::setmntent("/etc/mtab", "r"))] fails - Too many open files(24); [acceptable: [0]] At the time, reading for transfer, and record writing concurrently. |
We tried issue-37 branch version. Still the same behaviour with m5copy vbs ==> file transfers, open files just keep piling up, after transfer has completed. Number of open jive5ab process files going up in order of tens per scan transfer, after 15 minutes, hundreds of files, until a few thousand after hours and then problematic ~ 10,000 ; jive5ab can't access some disks; fails - Too many open files(24) We notice that e.g 11 files open during scan transfer, when transfer complete, some get get closed, then minutes later, 11 open again. We also see "too many open files" behaviour when we mount vbs_fs and try m5copy file ==> file transfer. |
We notice that it is a single vbs chunk, with the same file descriptor, open for read many times, by multiple threads. |
Hi ops-obs, Thanks for your test results! The oscillating between 7 and 11 open files seems OK ("seems root cause solved") - but if In discussions with other users it would seem that the current "fix" has unwanted side effects, so this definitely will not be the final solution. Stay tuned, but don't hold breath ... I wonder if it's coincidence or definite hint of something fundamentally wrong if it's the same file that remains open. |
Dear all, It seems that the previous issue was cleared and i worked well, thank you. jive5ab_error_log_20250219.txt However, since the VDIF file name was changed in the new version, we decided to use the old version until we are ready to change our local scripts which refers file names. EX. Thank you for your kind and quick support. |
Thanks for the feedback! |
I have one more request, if that's ok. After discussing the current approach with another user it became clear that whilst the current fix is OK for this specific use case, it would impact other use cases badly. Would you be kind enough to Note: for details about why another approach see the commit message |
After we updated jive5ab from 3.0.0 to 3.1.0, an error message [Too many open files] became to be popped out when we try to combine the observed VLBI data at FlexBuff.
SRC::VBS [127.0.0.1:2620] r11185_is_345-1659* ===> DST::FILE [127.0.0.1:2620:] /mnt/nfs_banana/vlbi/se_test/ Unacceptable return code 4 from query 'runtime=lkBFpvQi:transient' - !runtime = 4 : /usr/local/src/jive5ab/jive5ab-3.1.0/src/mountpoint.cc@837 [(mtab=::setmntent("/etc/mtab", "r"))] fails - Too many open files(24); [acceptable: [0]]
jive5ab_error_log.txt
In the attached log, the data [1647][1651][1654] were combined, but at the same time the file descriptors were left there and thus jive5ab kept holding those three files at last.
This problem does not occur in ver. 3.0.0.
There should be some bug in jive5ab-3.1.0.
I would be happy if someone could check this issue and fix the bug in jive5ab-3.1.0.
Thank you, and Merry Christmas!!
The text was updated successfully, but these errors were encountered: