1170115208 M * Bertl in both cases I see at least partially undefined behaviour
1170115225 M * Bertl in the first case, the mnt is cleared out of the nsproxy
1170115271 M * Bertl in the second case the nsproxy is disposed before the exit_notify
1170115291 M * daniel_hozac yeah, i see your point.
1170115294 M * Bertl or what am I missing here?
1170115375 M * daniel_hozac AFAICT, those cases are certainly possible...
1170115378 M * Bertl I don't see any locks or semaphores keeping that from happening
1170115410 M * Bertl so, IMHO, with the current concept of nsproxy potentially being shared
1170115445 M * Bertl we _must_not_ manipulate an nsproxy at any time (except at creation and destruction time) 
1170115464 M * Bertl i.e. before the first reference is taken and after the last reference is dropped
1170115481 M * Bertl (note: this is true for the current nfs patch)
1170115534 M * Bertl but, what the patch doesn't handle is that we also want to keep the mnt from doing evil stuff to the tasks
1170115559 M * Bertl (by getting released and hitting already partially tasks)
1170115564 M * Bertl +disposed
1170115639 M * Bertl my question now is, could be postpone the mnt disposal until all tasks are really gone?
1170115654 M * Bertl and more important, would that help?
1170115658 M * daniel_hozac how would we accomplish that? a kernel thread?
1170115687 M * Bertl we could simply keep a reference to the mnt
1170115709 M * daniel_hozac yeah, but what would cause it to finally be freed?
1170115709 M * Bertl wouldn't hurt to take one for each task, and drop it very late
1170115737 M * Bertl i.e. after tsk->state = TASK_DEAD 
1170115755 M * daniel_hozac the problem with the NFS code is that it's locking current->sighand, so current would have to refer to another process...
1170115804 M * Bertl okay, different approach, could we fix the nfs code?
1170115823 M * Bertl i.e. make the lock depend on sighand actually being there?
1170115832 M * daniel_hozac that was my initial thought too.
1170115854 M * Bertl my issues with the current patch are basically those:
1170115876 M * Bertl - the code is extremely hard to read/understand (at least for me)
1170115879 M * daniel_hozac TBH, i don't understand why the NFS code take the lock.
1170115899 M * Bertl - it doesn't work for OUR refcounting at all
1170115911 M * Bertl - it doesn't fix the issue for cornercases :)
1170115945 M * Bertl okay, could you point me to the lock/code?
1170115955 M * daniel_hozac fs/lockd/svc.c:lockd_down
1170115985 Q * duckx Quit: Client exiting
1170116023 M * daniel_hozac (Trond Myklebust seemed to suggest NFS does this in more places though)
1170116044 M * Bertl and the issue happens around the recalc_sigpending(), right?
1170116066 M * daniel_hozac right.
1170116158 M * Bertl was there an explanation why this locking is required?
1170116171 M * Bertl (or why it does the sigpending stuff at all?)
1170116204 M * daniel_hozac not that i saw.
1170116249 M * daniel_hozac i don't understand what signal it's expecting current to get.
1170116262 M * Bertl okay, I think I'm going to write an answer, and we can then think/discuss a solution with the involved parties
1170116290 M * Bertl IMHO this is some kind of notification between userspace and kernel thread or between kernel and thread
1170116323 M * Bertl this is definitely older than the kthread api which has other mechanisms in place
1170116339 M * daniel_hozac yeah
1170116350 M * Bertl do you agree that the second case is not handled by the patch?
1170116361 M * Bertl and that it might cause the same issues?
1170116369 M * daniel_hozac definitely. that would cause an Oops, IMHO.
1170116391 M * Bertl okay, I guess that isn't really easy to trigger though ...
1170116426 M * daniel_hozac yeah, i guess that would be the hard part.
1170116454 M * Bertl k, np
1170117098 M * PowerKe Bertl: Did you find anything in that tracefile I made?
1170117375 M * Bertl PowerKe: good question, let me ask a simple one, which tracefile? :)
1170117394 M * Bertl daniel_hozac: actually, I think we are wrong, the B case should be fine
1170117424 M * PowerKe the permission denied problem on my UI files
1170117425 M * daniel_hozac hmm, how so?
1170117430 M * Bertl daniel_hozac: the important part is to have a current->sighandler when disposing the mnt
1170117461 M * PowerKe 2.6.17-vs2.0.2.1-gentoo, jfs
1170117471 M * daniel_hozac i was under the impression we needed the pid namespace for exit_notify, from some of ebiederm's comments.
1170117479 M * Bertl PowerKe: please give me a short reminder and point me to the trace
1170117530 M * daniel_hozac PowerKe: as i told you, that's expected. the file is immutable.
1170117566 M * PowerKe odd, it doesn't do that on my other vserver
1170117646 M * Bertl there shouldn't be any inconsistancies, that's right
1170117655 M * Bertl do you have the trace(s) at hand?
1170117670 M * PowerKe http://powerpnt.dyndns.org:8080/vserver/
1170117686 M * daniel_hozac PowerKe: same kernel? files are UI?
1170117691 M * PowerKe older kernel
1170117704 M * Bertl ah, the utimes() issue
1170117708 M * PowerKe 2.6.14-vs2.0.1-gentoo
1170117751 M * Bertl I remember that we had a few versions which didn't check the permissions properly
1170117772 M * Bertl so I suspect the older gentoo kernel is just doing the wrong thing
1170117772 M * PowerKe So what I'm seeing on the older kernel would be the wrong behaviour
1170117774 M * daniel_hozac immutable is a vanilla flag though, so that should be checked everywhere.
1170117791 M * Bertl not necessarily for jfs
1170117810 M * daniel_hozac ah right... JFS only grew that support in 2.6.17, right?
1170117819 M * daniel_hozac (it's missing from 2.6.16)
1170117822 M * Bertl PowerKe: let's do a few simple tests in parallel on both systems, shall we?
1170117902 M * PowerKe ok
1170117961 M * Bertl first one: touch xx; chattr +i xx; touch xx  (on the jfs partition)
1170118007 M * PowerKe nexus vservers # touch xx; chattr +i xx; touch xx
1170118007 M * PowerKe chattr: Inappropriate ioctl for device while reading flags on xx
1170118105 M * Bertl and on the other one?
1170118106 M * daniel_hozac on the 2.6.17 box?
1170118155 M * PowerKe laptoptom vservers # touch xx; chattr +i xx; touch xx
1170118156 M * PowerKe touch: cannot touch `xx': Permission denied
1170118180 M * Bertl okay, so we now know that the first one is missing generic xattr support
1170118207 M * Bertl IIRC, I implemented it around 2.6.13 or so and it was included in mainline around 2.6.16
1170118224 M * PowerKe So I should consider the new behaviour as being correct
1170118234 M * Bertl yes, definitely for 2.0x
1170118252 M * Bertl with the 2.2 branch, you can enable the CoW link breaking
1170118266 M * Bertl in which case the utimes change 'should' break the link
1170118367 M * PowerKe What I did was unify 2 guests, but since they both have different mtimes, every file gets 2 hardlinks. One to the hash directory, one in the guest.
1170118369 M * Bertl IIRC, the reason for your doing was to sync the times, yes?
1170118401 M * Bertl daniel_hozac: would it be hard to extend the vhashify/unify by an option which allows to completely ignore the times?
1170118402 M * PowerKe What I used to do was then update the mtimes so they match and run vhashify again and delete the copies with 1 hardlink in the hashing folder
1170118413 M * PowerKe Bertl: He already made that one
1170118413 M * daniel_hozac Bertl: already done :)
1170118430 M * Bertl excellent, so that should be a nonissue then? good work!
1170118457 M * PowerKe daniel_hozac: which mtime is taken in that case? Is there a rule or is it random?
1170118476 M * daniel_hozac the first one to put the link in the hash directory.
1170118524 M * Bertl what if the hash directory already contains that entry?
1170118557 M * daniel_hozac as the mtime is part of the hash, that's rather unlikely.
1170118559 M * Bertl i.e. real world example:
1170118575 M * Bertl unify guests A and B
1170118592 M * Bertl then run touch on the hash
1170118608 M * Bertl (with 2.2 this will break the links there, right?)
1170118611 M * daniel_hozac that should break the link.
1170118627 M * Bertl okay, now I re-unify A and B, what happens?
1170118652 M * Bertl do I get new hash files? is the hash now twice as large?
1170118669 M * Bertl or will they be 'just' overwritten?
1170118722 M * daniel_hozac by default, already unified files are just skipped.
1170118735 M * PowerKe if the mtime is part of the hash, then how do you match them?
1170118752 M * daniel_hozac the mtime isn't part of the hash if --ignore-mtime is specified.
1170118759 M * Bertl something completely different: how does your nfs test setup look like (right now) to trigger the oops?
1170118920 M * daniel_hozac after boot, just service portmap start; ./test-nfs mount -t nfs stagioni:/pub /mnt
1170118948 M * daniel_hozac (where test-nfs just does unshare(CLONE_NEWNS); execvp(argv[1], argv+1);)
1170118958 M * Bertl okay, excellent
1170118996 M * Bertl what if we add the following before exit_notify()
1170119062 M * PowerKe To be able to change a file with UI flags from within the guest, I need to enable the CoW link breaking kernel flag?
1170119091 M * Bertl yes, but you can unlink it (remove/delete)
1170119144 Q * Piet Remote host closed the connection
1170119144 M * Bertl daniel_hozac: sleep(tsk->pid % 3);
1170119177 M * Bertl if you now spawn a second process inside that namespace
1170119180 J * Piet hiddenserv@tor.noreply.org
1170119205 M * Bertl that should have a good chance to eventually hit case B, no?
1170119216 M * daniel_hozac there are two processes created already, mount and mount.nfs.
1170119226 M * Bertl even better :)
1170119264 M * Bertl couldn't hurt to add a printk() there too
1170119476 M * daniel_hozac so which sleep do we want?
1170119504 M * Bertl well, A should sleep longer than B
1170119524 M * Bertl and A should actually start dying earlier
1170119556 M * Bertl that's why I think the % 3 might be a good idea
1170119569 M * Bertl you will get 'pseudo random' 0,1,2 sec sleeps
1170120240 M * daniel_hozac Hollow, phreak``, Gentoo users: if some of you could test http://people.linux-vserver.org/~dhozac/p/uv/experimental/util-vserver-0.30.213-rc1.tar.bz2, especially the initscripts and their installation, it would be very much appreciated.
1170120484 M * daniel_hozac Bertl: did you see Hollow's bug, btw? http://paste.linux-vserver.org/986
1170120499 M * Bertl nope, not yet
1170120555 M * Bertl a context gets disposed with claims?
1170120562 M * daniel_hozac looks like it.
1170120685 M * Bertl is it reproduceable?
1170120790 M * daniel_hozac i think so.
1170120817 M * Bertl should be easy to fix then, we just need the proper debug run
1170120836 M * daniel_hozac yeah.
1170120849 M * daniel_hozac (damn, booting the modified kernel is taking forever...)
1170120873 M * Bertl heh, quite some processes on startup, eh?
1170120881 M * daniel_hozac yeah, apparently.
1170120892 M * Bertl btw, doing usleep() should be enough
1170120901 M * Bertl I know, too late :)
1170120907 M * daniel_hozac hehe.
1170120917 M * daniel_hozac might be worth it...
1170120924 M * daniel_hozac it still hasn't brought up my interfaces.
1170121001 M * daniel_hozac hmm, though, the processes need to die at the same time, right?
1170121031 M * Bertl Hollow: please recreate with debugging enabled, xid=1,2,3,4,6 switch=0,1 misc=4,5
1170121066 M * daniel_hozac for that i guess we need another test-case, as i believe mount is waiting on mount.nfs to die.
1170121105 M * Bertl hmm, could work nevertheless, as the 'wait' happens in the notify, no?
1170121125 M * Bertl so basically adding the sleep before and after that, would help
1170121140 M * Bertl nah, probably now :)
1170121144 M * Bertl *not
1170121159 M * Bertl anyway, adding another process should not be too hard
1170121210 M * daniel_hozac indeed, unshare(CLONE_NEWNS); fork(); exit(0); should work, no?
1170121222 M * Bertl yes
1170121253 M * daniel_hozac i have to admit that i don't see where in exit_notify we need the pid namespace though...
1170121267 M * daniel_hozac at least, not the very small part of it that has been merged into mainline now.
1170121585 M * daniel_hozac i just get the lockd_down oops now.
1170121598 M * Bertl with the patch?
1170121605 M * daniel_hozac no, i removed the patch.
1170121612 M * daniel_hozac ... that wasn't very clever, was it.
1170121624 M * Bertl hmm, weeeell ...
1170121656 M * Bertl we have now verified that it still oopses :)
1170121694 M * daniel_hozac sorry, i don't know what i was thinking... i'm starting to get a bit tired, so i'll blame it on that :)
1170121714 M * Bertl hey, np, just happens ...
1170121745 M * Bertl just think how many other things could have gone wrong, and you got them all right :)
1170121807 M * daniel_hozac lol
1170121840 M * daniel_hozac seems you're not the only one with issues with the patch.
1170121893 M * Bertl ah?
1170122080 M * daniel_hozac see Oleg Nesterov's email.
1170122126 M * daniel_hozac seems like a legit concern as well.
1170122159 M * Bertl indeed
1170123001 M * daniel_hozac i can't seem to break it.
1170123014 M * Bertl that's what I assumed ...
1170123039 M * Bertl current will always have a sighand with that patch
1170123053 M * daniel_hozac yeah
1170123054 M * Bertl so we cannot argue that it actually breaks in certain cases
1170123091 M * daniel_hozac well, the second case could certainly break once the full pid namespace is in.
1170123117 M * daniel_hozac (i assume that's when it'll be required for exit_notify)
1170123133 M * Bertl yes, I will document it on lkml and state my 'issues' with the patch itself
1170123145 M * Bertl but I guess it should be fine for now
1170123177 M * Bertl in addition, I'm going to add some accounting for the spaces to 2.2+
1170123191 M * Bertl i.e. mnt, uts, ipc and proxy
1170123193 M * daniel_hozac what kind of accounting?
1170123204 M * Bertl so that we get an idea if we lose one or the other :)
1170123213 M * daniel_hozac ah, hehe.
1170123429 M * daniel_hozac well, i think i'm gonna get some sleep now... good night!
1170123442 M * Bertl have a good one, and thanks for the help!
1170125110 J * ensc ~irc-ensc@p54B4EC6D.dip.t-dialin.net
1170126833 Q * anonc Ping timeout: 480 seconds
1170127021 Q * Adrinael Read error: Connection reset by peer
1170127023 J * Adrinael adrinael@st12-127.tky.hut.fi
1170127351 Q * Piet Quit: Piet
1170127736 Q * Roey Ping timeout: 480 seconds
1170127758 J * Roey ~katz@h-69-3-4-130.mclnva23.covad.net
1170131353 Q * lylix Read error: Connection reset by peer
1170133380 J * stefani ~stefani@c-24-19-46-211.hsd1.wa.comcast.net
1170134670 M * Bertl wb stefani!
1170134728 M * stefani hola
1170134887 M * Bertl how's going?
1170134967 M * stefani slooowwww
1170135009 M * Bertl you must be using something else than Linux-VServer :)
1170135154 M * stefani at the moment, i'm not focussing on vserver much .   the ones i have just work... 
1170135189 M * Bertl that's the way it should be ... anyway, not going to bother you any longer ...
1170135284 Q * nou Ping timeout: 480 seconds
1170135285 M * stefani np.
1170137076 M * Bertl daniel_hozac: just stumbled over the util-vserver clone implementation; my manpage for clone says that since Linux 2.5.49, the syscall has five instead of two arguments, which I can confirm for 2.6.19 :)
1170138793 Q * Hsilamot Ping timeout: 480 seconds
1170139200 P * stefani parting (is such sweet sorrow)
1170139541 Q * tanteauguri Quit: using sirc version 2.211+KSIRC/1.3.12
1170140588 J * gab ~gab@158.36.45.236
1170141637 J * greedy ~jesper@221.217.18.62
1170141850 Q * dlezcano Quit: Leaving
1170144467 Q * gab Remote host closed the connection
1170144815 M * Hollow morning
1170144845 M * Bertl good morning ...
1170144875 M * Bertl Hollow: do you use sys_clone()?
1170144887 M * Hollow in libvserver?
1170144892 M * Bertl yes, and vcd
1170144896 M * Hollow sure i do
1170144906 M * Bertl how many arguments do you pass?
1170144932 M * Hollow flags and child_stack
1170144935 M * Hollow i.e. 2
1170144947 M * Bertl interesting, because it takes 5 :)
1170144971 M * Bertl note: I'm talking about sys_clone() not clone()
1170144982 M * Hollow it always worked with syscall3(), also the clone manpage lists it
1170144999 M * Hollow syscall2 of course
1170145002 M * Bertl see my note above ...
1170145039 M * Hollow ?
1170145046 M * Hollow don't have abacklog here
1170145057 M * Bertl my manpage for clone says that since Linux 2.5.49, the syscall has five instead of two arguments, which I can confirm for 2.6.19 :)
1170145091 M * Bertl so the 2 argument version is only true for 2.4 kernels
1170145092 M * Hollow well.. i can change it if you want, but i never had problems with the older interface
1170145102 M * Hollow and i don't need the other arguments
1170145111 M * Bertl well, the other arguments are random values in a syscall2
1170145137 M * Bertl so they might as well give funny results, by overwriting your program
1170145150 M * Bertl (or segfaulting in the syscall, etc ...)
1170145194 M * Hollow ok.. can i just pass NULL for the other 3?
1170145204 M * Bertl haven't verified that yet
1170145219 M * Bertl but I would assume so, at least for two of them
1170145281 M * Hollow btw, util-vserver uses the syscall2 version too
1170145323 J * zhou_rock ~zhou@202.170.218.253
1170145370 M * Hollow daniel_hozac: you around?
1170145613 M * Bertl one possible explanation (unverified) would be that the 5 arg syscall is somehow backward compatible
1170145619 M * Bertl welcome zhou_rock!
1170145635 M * zhou_rock thanks
1170145714 M * zhou_rock glad to meet you
1170145778 M * Bertl the pleasure is mine, what's up?
1170145854 M * Hollow Bertl: well, i use your shiny syscall, should be easy to verify, no?
1170145911 M * Bertl passing -1 for all three arguments could give a clue
1170145959 J * gab ~gab@158.36.45.236
1170145979 M * Bertl welcome gab!
1170145988 M * Hollow i.e. using syscall5 with -1 for the 3 new arguments?
1170146023 M * gab good moring Bertl
1170146031 M * Bertl Hollow: yep, that should not be handled special, but OTOH, it should we the worst case scenario
1170146052 M * Hollow ok, will try that when back at home
1170146077 M * Bertl okay, will take a look at the implementation in the evening ...
1170146099 M * Hollow Bertl: btw, did you see http://paste.linux-vserver.org/986 ?
1170146120 M * Bertl yes, I wrote an answer too, sec, let me copy that one too
1170146129 M * Hollow hapened during startup when the helper segfaulted... so it si "fixed" now
1170146178 M * Bertl 02:37 < Bertl> Hollow: please recreate with debugging enabled, xid=1,2,3,4,6  switch=0,1 misc=4,5
1170146200 M * Hollow ok, i have to make the helper segfault again then, but will take a look ;)
1170146224 M * Bertl if that is really recreateable, it will be interesting
1170146246 M * Bertl it seems that a context is disposed while still claimed
1170146308 M * Hollow yeah.. it wasn't even the helper that segfaulted actually... the helper called vcd to do the startup work, vcd segfaulted during mount, but the vshelper still existed in some weird blocked select()
1170146349 M * Bertl so I'm definitely interested if you can trigger that on recent kernels
1170146354 M * Hollow so the khelper thread was still active which is why the context was claimed probably
1170146361 M * Hollow it was on 2.2.0-rc8.7
1170146368 M * Hollow but i will try to recreate that
1170146385 M * Bertl okay, tx
1170146497 J * ntrs ~ntrs@68-188-55-120.dhcp.stls.mo.charter.com
1170146602 P * zhou_rock 
1170146689 M * Bertl okay, off to bed now ... have a good one everyone! cya!
1170146694 N * Bertl Bertl_zZ
1170146695 M * Hollow night!
1170146876 M * Hollow off again too
1170146878 Q * ntrs_ Ping timeout: 480 seconds
1170146998 J * dna ~naucki@24-193-dsl.kielnet.net
1170147038 M * matti :)
1170147990 P * click [IRSSI]
1170149778 Q * m`m`h Ping timeout: 480 seconds
1170150357 J * click click@ti511110a080-5475.bb.online.no
1170151489 J * m`m`h ~simba@deb30.mgts.by
1170151928 Q * marcfiu Ping timeout: 480 seconds
1170151988 J * DavidS ~david@vpn.uni-ak.ac.at
1170152268 J * meandtheshel1 ~markus@85-124-206-1.dynamic.xdsl-line.inode.at
1170153540 J * marcfi1 ~mef@c-68-39-177-97.hsd1.nj.comcast.net
1170153587 P * marcfi1 
1170154040 Q * greedy Quit: using sirc version 2.211+KSIRC/1.3.12
1170156757 Q * shedi Quit: Leaving
1170159269 Q * Aiken Quit: Leaving
1170159528 Q * trippeh Quit: fikse litt charset-tull
1170159873 Q * cdrx Ping timeout: 480 seconds
1170160627 J * me ~me@p548AA222.dip0.t-ipconnect.de
1170160643 M * me good morning, I've got a kernel oops on restarting a vserver: http://nopaste.linux-dev.org/?1089
1170160665 M * me Could anyone say me maybe something about it? The root and the vserver are working ( without a restart )
1170160681 M * daniel_hozac you're using the broken Debian vserver kernel.
1170160695 M * daniel_hozac you should upgrade to 2.6.18-4.
1170160974 M * me okay, the one from sid, thanks
1170161300 J * Wajat MasteR@77-105-56-172.adsl-1.sezampro.yu
1170161961 Q * me Remote host closed the connection
1170161991 Q * transacid Ping timeout: 480 seconds
1170161999 M * daniel_hozac Bertl_zZ, Hollow: AFAICT the other three arguments are dependent on CLONE_PARENT_SETTID, CLONE_CHILD_SETTID, and CLONE_SETTLS, respectively.
1170162571 J * transacid ~transacid@transacid.de
1170162614 J * cdrx ~legoater@blueice4n2.uk.ibm.com
1170163884 Q * Wajat 
1170166325 J * Piet hiddenserv@tor.noreply.org
1170166679 M * Guy- if I used hashify, and then deleted some hashified files from a vserver so that they are now only in .hash, how do I get rid of them?
1170166686 M * Guy- find -tpye f -links 1 | xargs rm?
1170166698 M * Guy- or is there a Better Way (tm)?
1170166852 J * lilalinux ~plasma@dslb-084-058-203-076.pools.arcor-ip.net
1170166916 M * Guy- also, I have some hashified files with more links than vservers, which leads me to believe that hashify does in fact not skip files that have links counts greater than one, which is bad
1170168712 M * daniel_hozac find is pretty much the only way to clean up.
1170169765 P * DavidS 
1170169766 Q * Piet Remote host closed the connection
1170169860 J * Piet hiddenserv@tor.noreply.org
1170170134 Q * gab Quit: Leaving
1170170268 M * Hollow daniel_hozac: the execve error was so dumb .. i have allocated argv using malloc(argc + 1) not malloc((argc + 1) * sizeof(char *))
1170170284 M * daniel_hozac ah, hehehe.
1170170353 M * Hollow daniel_hozac: it was clearly not the same bug as we experienced before where gcc4.1.1 seemed to f*ck up the envp in execve
1170170395 M * daniel_hozac apparently not :)
1170170422 J * shuri ~shuri@hq01.electronicbox.net
1170170460 M * Hollow daniel_hozac: btw, do you know if there is any arch where pagesize < 1024?
1170170603 M * daniel_hozac it seems the smallest page size on Linux is 4096.
1170170612 J * ybanafa ybanafa@82.114.188.218
1170170734 M * daniel_hozac (judging by a quick grep 'define[[:space:]]*PAGE_SIZE' include/asm-*/page.h, anyway)
1170170746 M * ybanafa Hi All .. my eth0 connect to my isp dhcp .. what is the best way to build vps (I did not see any doc about dhcp-host & static-guest)
1170170819 M * daniel_hozac ybanafa: assign the guests private IP addresses, and use NAT?
1170170919 M * ybanafa daniel_hozac : Thanx .. is there doc/link about that ?
1170170960 J * Sagitrio ~S@213.87.72.70
1170170969 M * daniel_hozac http://iptables.org/documentation/HOWTO//NAT-HOWTO.html maybe.
1170170991 M * ybanafa daniel_hozac : Thanx ++
1170171004 M * Sagitrio ������� +)
1170171010 M * Sagitrio ��� ���� ���� ?
1170171162 Q * Sagitrio 
1170171186 Q * ybanafa Quit: using sirc version 2.211+KSIRC/1.3.11
1170171297 J * hallyn ~xa@cpe-72-179-43-119.austin.res.rr.com
1170171389 M * daniel_hozac hey hallyn.
1170172837 M * daniel_hozac hallyn: what tree is your latest patch against?
1170173018 J * FireEgl Proteus@2001:5c0:84dc:1:211:9ff:feca:b042
1170173433 N * Bertl_zZ Bertl
1170173436 M * Bertl morning folks!
1170173438 M * daniel_hozac morning Bertl!
1170174151 T * * http://linux-vserver.org/ | latest stable 2.0.2.1, 2.0.3-rc1, 2.2.0-rc8.7, devel 2.1.1.7.1, 2.3.0.8, stable+grsec 2.0.2.1, 2.2.0-rc8, devel+grsec 2.1.1 | util-vserver-0.30.212 | libvserver-1.0.2 & vserver-utils-1.0.3 | He who asks a question is a fool for a minute; he who doesn't ask is a fool for a lifetime -- share the gained knowledge on the Wiki, and we'll forget about the minute ;)
1170174151 T * Bertl -
1170174160 M * ciphergoth at shutdown I get this, why? "cat: /proc/cmdline: No such file or directory"
1170174166 M * ciphergoth and "mount: permission denied"
1170174391 Q * lilalinux Remote host closed the connection
1170174446 Q * tokkee cation.oftc.net galapagos.oftc.net
1170174464 J * yarihm ~yarihm@vpn-global-dhcp2-26.ethz.ch
1170174580 J * tokkee tokkee@casella.verplant.org
1170174599 Q * ciphergoth Quit: Client exiting
1170174603 J * lilalinux ~plasma@dslb-084-058-203-076.pools.arcor-ip.net
1170174771 J * Greek0_ ~greek0@85.255.145.201
1170174772 Q * Greek0 Read error: Connection reset by peer
1170175191 Q * Johnnie Remote host closed the connection
1170175224 J * Johnnie ~jdlewis@jdlewis.org
1170175266 M * Bertl daniel_hozac: so syscall2 should be fine even on 2.6 (without the special flags)?
1170175565 M * daniel_hozac i think so.
1170176101 M * Bertl okay, good then ...
1170176113 M * Bertl off for now .. will be back in the evening ... have fun!
1170176119 N * Bertl Bertl_oO
1170176168 J * stefani ~stefani@tsipoor.banerian.org
1170176791 J * bonbons ~bonbons@83.222.37.103
1170177333 Q * FaUl Ping timeout: 480 seconds
1170179578 J * Pazzo ~ugelt@dialin-225136.rol.raiffeisen.net
1170179688 M * Pazzo hi all! running nfs-user-server within a vserver is no problem, but right now I'm trying to let a vserver mount a nfs share. could someone give me a hint on how to accomplish this?
1170179706 M * daniel_hozac is it absolutely necessary to have the guest mount it?
1170179720 M * daniel_hozac however, with the secure_mount and binary_mount ccapabilities, that should work fine.
1170179724 M * Pazzo hi daniel_hozac!
1170179731 M * daniel_hozac hey
1170179753 M * Pazzo I added secure_mount, binary_mount and secure_remount to ccapabilities
1170179763 M * Pazzo -> stale nfs file handle
1170179802 M * Pazzo unfs3 is running within the same vserver, shares there can successfully be mounted
1170179963 M * daniel_hozac so you can mount things from the local server, just not a remote one?
1170179981 M * Pazzo hmm... let me explain:
1170180012 M * Pazzo vhost1 and vhost2 are the "real" servers
1170180019 M * Pazzo running vserver1 and vserver2
1170180077 M * Pazzo atm vserver1 is still a single server, needs to be moved
1170180099 M * Pazzo vserver2 has been a "real" server, but is right now already a vserver
1170180121 M * Pazzo both servers have been using nfs shares (somehow "cross-mounting" things)
1170180152 M * Pazzo now I need to accomplish the same job within vservers
1170180190 M * daniel_hozac is there any particular reason you can't use the fstab file?
1170180218 M * Pazzo so I installed unfs3 on vserver2, works fine and can be mounted from vserver1
1170180276 M * Pazzo daniel_hozac: hmmm... no.
1170180286 M * Pazzo I have never been doing so
1170180314 M * Pazzo I probably need portmapper on host-side to do so, is this correct?
1170180337 M * daniel_hozac i guess.
1170180350 M * daniel_hozac but IME, you need portmapper on the host even for guest mounts.
1170180356 M * Pazzo aaaah!
1170180364 M * Pazzo ? hmmm
1170180405 M * Pazzo guest mounts are using the host's portmapper?
1170180432 M * daniel_hozac lockd is, at least.
1170180453 M * Pazzo ok
1170180465 M * daniel_hozac (hmm, that's another problem with doing the mount in a guest. it'll want to spawn a kernel thread, and that's blocked...)
1170180488 M * Pazzo are guest mounts to considered "harmful"?
1170180493 Q * cdrx Quit: Leaving
1170180517 J * cdrx ~legoater@blueice3n1.uk.ibm.com
1170180530 M * Pazzo btw: my "co-worker" killed the host 5 minutes ago...
1170180549 M * daniel_hozac depends on what you're referring to, i suppose.
1170180567 M * Pazzo ...is this a "to-be-expected-feature" when trying to do nfs-mounts from inside a vserver?
1170180599 M * daniel_hozac well, right now, being able to do the mount in the first place needs some luck :)
1170180609 J * duckx ~Duck@tox.dyndns.org
1170180646 M * Pazzo daniel_hozac: ok, thanks a lot for your assistance!
1170180665 M * Pazzo so I'll NEVER again use guest-mounts
1170180672 M * daniel_hozac using the guest's fstab.remote file should avoid all of the problems.
1170180676 M * Pazzo install portmapper on the host
1170180732 M * Pazzo and use... fstab.remote :-)
1170180765 M * Pazzo thnx a lot daniel, I have to leave now as it really seems that we shoot this host in the head :o)
1170180773 N * Pazzo PazzoOoo
1170180795 M * daniel_hozac heh.
1170180803 M * daniel_hozac you're welcome!
1170180877 J * FaUl immo@shell.chaostreff-dortmund.de
1170180993 Q * yarihm Ping timeout: 480 seconds
1170181467 Q * cdrx Read error: Connection reset by peer
1170181952 Q * bronson Read error: Connection reset by peer
1170184013 J * cdrx ~legoater@cap31-3-82-227-199-249.fbx.proxad.net
1170184029 J * abackos ~irc@ip70-190-32-70.ph.ph.cox.net
1170184354 Q * lilalinux Remote host closed the connection
1170184754 Q * abackos 
1170184755 Q * ntrs Remote host closed the connection
1170185042 J * gerrit ~gerrit@bi01p1.co.us.ibm.com
1170186205 Q * FireEgl Remote host closed the connection
1170186226 N * PazzoOoo Pazzo
1170186246 M * Pazzo re
1170186304 M * Pazzo daniel_hozac: works like a charm! I'll touch guest-mounts never again :-p
1170186498 J * marcfiu ~mef@aegis.CS.Princeton.EDU
1170186846 M * Hollow Bertl_oO, daniel_hozac: guess i've found another bug.. empty (persistent) contexts have a task count of 1
1170186863 M * daniel_hozac vx_tasks? that's expected.
1170186875 M * daniel_hozac it's so we don't free the context.
1170186886 M * Hollow ok ..
1170186888 M * daniel_hozac you should check the rlimit_stat of NPROC.
1170186904 M * Hollow ah, good idea
1170186948 Q * Pazzo Quit: ...
1170187051 M * sid3windr is there any chance vserver messes with usb storage?
1170187073 M * daniel_hozac highly unlikely. why?
1170187078 M * sid3windr that's what I thought
1170187091 M * sid3windr i've got a problem with a usb storage device and googling stuff there was one guy on fedora who had the same and it started to work when he disabled selinux
1170187102 M * sid3windr (or so he says)
1170187110 M * daniel_hozac heh.
1170187117 M * daniel_hozac what kind of problem?
1170187642 M * sid3windr usb 1-3: device not accepting address 4, error -62
1170187653 M * sid3windr it's a very long shot to think vserver has anything to do with it though :)
1170187698 M * daniel_hozac hmm -ETIME (Timer expired)? sounds more like a problem with the device, IMHO.
1170187953 M * Hollow daniel_hozac: works :) http://paste.linux-vserver.org/994
1170187973 M * Hollow without /proc parsing :)
1170187997 M * Hollow (except to get the running xids)
1170188014 M * daniel_hozac looks like the new vserver-stat ;)
1170188024 M * Hollow yep :)
1170188042 M * Hollow it's just the first time i see the new syscalls in action ;)
1170188077 M * daniel_hozac hehe.
1170188223 M * daniel_hozac how do you find out the number of CPUs?
1170188659 M * Hollow daniel_hozac: sysconf(_SC_NPROCESSORS_ONLN);
1170188712 M * daniel_hozac ah, nice. i didn't even think to consult sysconf.
1170188789 M * daniel_hozac then i can get rid of the   // XXX: arbitrary CPU limit. for (cpu = 0; cpu < 1024; cpu++) {
1170188812 M * Hollow heh :) i stumbled over the sysconf value some time ago, but i had some hackish workaround as well ;)
1170188862 M * daniel_hozac is it possible to, with CPU hotplug, get CPU 0 and CPU 2 online, with _SC_NPROCESSORS_ONLN then returning 2?
1170188925 M * Hollow no idea ..
1170189149 M * daniel_hozac is sysconf a syscall?
1170189265 M * daniel_hozac hmm, it seems to parse /proc/stat.
1170189293 M * Hollow bah
1170190605 Q * dna Ping timeout: 480 seconds
1170191017 Q * marcfiu Remote host closed the connection
1170191443 J * dna ~naucki@24-193-dsl.kielnet.net
1170191551 J * DreamerC_ ~dreamerc@125-225-98-45.dynamic.hinet.net
1170191650 J * Aiken ~james@ppp218-108.lns2.bne1.internode.on.net
1170191754 J * toom ~toom@coco.docisland.org
1170191760 M * toom hi
1170191777 M * daniel_hozac hello
1170191883 M * toom is there a known problem in sendfile syscall with 2.6.19.2-grsec2.1.10-vs2.2.0-rc8 ?
1170191914 M * daniel_hozac no.
1170191951 Q * DreamerC Ping timeout: 480 seconds
1170191989 M * toom my apache-2.2.3 doesn't work and fails in sendfile syscall (EINVAL)
1170192228 Q * bonbons Quit: Leaving
1170192388 M * daniel_hozac works fine here.
1170192405 M * daniel_hozac (although, i'm not using grsec)
1170192468 M * toom I use apache2 in the root server and it works fine
1170192507 M * toom when I chroot (without chcontext), it doesn't work
1170192557 M * toom I'll try to remove grsec
1170192614 M * daniel_hozac that makes it sound more like your apache is broken.
1170192640 M * daniel_hozac are your host and guest running the same distribution?
1170192651 M * toom no
1170192669 M * toom debian unstable (root) and debian etch (in vserver)
1170192711 M * daniel_hozac different versions of apache installed?
1170192743 M * Radiance using grsec here and apache 2.x just fine btw
1170192769 M * toom same versions
1170192780 M * Radiance but a .7.14 kernel and fitting patches
1170192886 M * daniel_hozac could you paste the strace surrounding the error?
1170192905 M * toom k
1170193119 M * toom http://code.bulix.org/x8dtls-31936
1170193224 M * daniel_hozac is your filesystem/block device/whatever is needed supporting mmap?
1170193247 M * toom ah !
1170193252 M * toom I use unionfs
1170193289 M * daniel_hozac well, that's external so i can't check the source.
1170193291 Q * gerrit Ping timeout: 480 seconds
1170193298 M * daniel_hozac (well i could, but i'm lazy :))
1170193298 J * yarihm ~yarihm@84-75-123-221.dclient.hispeed.ch
1170193305 M * toom ok :)
1170193306 M * daniel_hozac but i imagine that's why.
1170193314 M * toom I think so
1170193320 M * toom thank you
1170193322 M * daniel_hozac try getting your host's apache to serve from the same directory.
1170193356 M * toom yes, I'll do that, thx
1170193364 M * daniel_hozac np.
1170193438 J * gerrit ~gerrit@bi01p1.co.us.ibm.com
1170193647 P * toom 
1170193748 J * Piet_ hiddenserv@tor.noreply.org
1170194040 Q * DreamerC_ Quit: leaving
1170194060 J * DreamerC ~dreamerc@125-225-98-45.dynamic.hinet.net
1170194113 Q * Piet Ping timeout: 480 seconds
1170194162 J * chris_ ~chris@e179192098.adsl.alicedsl.de
1170194615 Q * chris_ Quit: leaving
1170195178 N * Bertl_oO Bertl
1170195183 M * Bertl evening folks!
1170195195 M * shuri hi Bertl 
1170195265 M * shuri hey Bertl i did not find any way to fix the segmentation errors with utils-vserver on ubuntu
1170195297 M * shuri remove dietlibc from ubuntu compile it from source and i got the same
1170195482 M * Bertl okay, we should definitely look into that, got some time?
1170195537 M * shuri got 40 minutes
1170195582 M * Hollow is there a way to find out if a directory is a mount point without parsing /proc/mounts?
1170195602 M * shuri df ?
1170195611 M * Hollow without parsing df too
1170195632 M * Hollow some syscall would be appretiated :)
1170195753 M * Bertl yes, check the directory inodes
1170195773 M * Bertl if rdevs do not match, you have a mountpoint
1170195791 M * Bertl but careful, that does not address --bind or --move mounts
1170195801 M * Bertl which are basically undetectable atm
1170195819 M * Bertl shuri: okay, you get the segfault unconditionally?
1170195834 M * shuri vserver-stats
1170195839 M * Bertl shuri: what exactly does segfault?
1170195850 M * Hollow ah, good idea.. guess i can live with knowing normal mount points for now
1170195866 M * shuri vserver-stat 
1170195866 M * shuri Erreur de segmentation
1170195873 P * stefani I'm Parting (the water)
1170195873 M * shuri segmentation fault
1170195884 M * Bertl okay, first lets do export LC_ALL=C LANG=POSIX
1170195889 M * shuri same with vserver-info
1170195899 M * shuri lol
1170195900 M * shuri ok
1170195902 M * Hollow Bertl: btw, the new sched_info syscall works great: http://paste.linux-vserver.org/995 (used for sys/user time)
1170195904 M * Bertl do you get a coredump?
1170195912 M * shuri vserver-info 
1170195912 M * shuri Segmentation fault
1170195918 M * Bertl Hollow: cool!
1170195936 M * shuri i only get this output
1170195948 M * Bertl okay, so no core.xxxx file?
1170195967 Q * meandtheshel1 Quit: Leaving.
1170195981 M * shuri no
1170196024 M * Bertl 'ulimit -Hc' says?
1170196049 M * shuri ulimit -H
1170196049 M * shuri unlimited
1170196050 M * Bertl and 'ulimit -Sc'?
1170196052 M * shuri but right now
1170196066 M * shuri i am not on a kernel with vserver patch
1170196097 M * Bertl hmm, okay?
1170196116 M * Bertl you don't expect it to work there, do you? :)
1170196125 M * shuri nop
1170196138 M * shuri but the segmentation fault is not related with the kernel
1170196148 M * Bertl hum
1170196163 M * Bertl so that would mean it is ubuntu related, no?
1170196167 M * shuri yes
1170196185 M * Bertl okay, so a problem with the ubuntu toolchain then
1170196205 M * Bertl did you try to disable the alternative syscalls?
1170196222 M * Bertl btw, what arch is that? sparc or x86?
1170196244 M * shuri let me try
1170196245 M * shuri x86
1170196260 M * shuri ./configure --disable-alternative-syscalls
1170196275 M * Bertl sounds about right
1170196422 M * shuri got compile error
1170196429 M * shuri ps/lib_libvserver_a-syscall.Tpo"; exit 1; fi
1170196429 M * shuri In file included from lib/syscall.c:33:
1170196429 M * shuri lib/vserver-internal.h: In function 'vserver':
1170196436 M * shuri lib/vserver-internal.h:257: error: conflicting types for 'syscall'
1170196436 M * shuri /usr/lib/diet/include/sys/syscall.h:6: error: previous declaration of 'syscall' was here
1170196436 M * shuri make[2]: *** [lib/lib_libvserver_a-syscall.o] Error 1
1170196442 M * shuri make[2]: Leaving directory `/home/shuri/Desktop/util-vserver-0.30.212'
1170196442 M * shuri make[1]: *** [all-recursive] Error 1
1170196442 M * shuri make[1]: Leaving directory `/home/shuri/Desktop/util-vserver-0.30.212'
1170196442 M * shuri make: *** [all] Error 2
1170196489 M * Bertl okay, try to disable diet too
1170196763 M * shuri vserver-stat 
1170196764 M * shuri can not change context: migrate kernel feature missing and 'compat' API disabled: Function not implemente
1170196776 M * shuri sthe error is normal
1170196783 M * shuri this error is normal
1170196799 M * shuri but not more Segmentation fault
1170196805 M * Bertl okay, so that works for you
1170196825 M * Bertl let me guess, ubuntu uses a 4.0.2 gcc toolchain?
1170196891 J * pflanze ~chris@84-73-56-44.dclient.hispeed.ch
1170196892 M * shuri                        CC: gcc, gcc (GCC) 4.1.2 20070106 (prerelease) (Ubuntu 4.1.1-21ubuntu7)
1170196892 M * shuri                       CXX: g++, g++ (GCC) 4.1.2 20070106 (prerelease) (Ubuntu 4.1.1-21ubuntu7
1170196907 M * Bertl yeah, sorry, I meant 4.1.2 ...
1170196928 M * pflanze Hello
1170196936 M * Bertl okay, np, I'm going to rewrite the shiny stuff to cope with those newer compiler bugs ...
1170196957 M * Bertl shuri: but I will need some testing, can you do that for me in the next few days?
1170196963 M * shuri yes
1170197010 M * Bertl okay, will prepare something tonight or tomorrow, and do some testing on my own first, then I'll hand that to you ...
1170197020 M * shuri perfect
1170197023 M * Bertl hey pflanze!
1170197045 M * shuri pm me when you are ready
1170197058 Q * gerrit Ping timeout: 480 seconds
1170197068 M * pflanze Semi-ontopic question: does anyone know how to dump core files of processes to safe places? (especially ones with changing uids). The on-topic part of the question is, how to do it inside vservers (/proc/sys/kernel/core_* seem not virtualized).
1170197074 M * Bertl shuri: perfect, tx
1170197092 M * shuri thx to you!
1170197108 M * Bertl pflanze: hmm, yes, core dumps are always an issue
1170197135 M * Bertl pflanze: by default, the binary dumps right into the current directory
1170197153 M * Bertl pflanze: the kernel honors the ulimits for core dump sizes
1170197202 M * pflanze I've got an intermittent problem with qmail I'm trying to track down and would like to make it dump core,
1170197216 M * pflanze but a) I don't like it writing files into the queue, and b) maybe it even can't.
1170197245 M * Bertl well, the can't part is not very likely, as the kernel actually does the core dumping
1170197246 A * pflanze checks the proc files on the host
1170197264 M * pflanze hm it doesn't honor permissions?
1170197272 M * pflanze e.g. acts as root?
1170197293 M * Bertl well, it probably does a few checks, but it's not a 'can not' issue :)
1170197313 M * Bertl i.e. it might be a 'should not' and 'does not' issue :)
1170197395 A * pflanze tries
1170197446 M * Bertl but I wonder for some time now, if there isn't a mechanism to control the coredump in a more precise way
1170197464 M * Bertl I have to admit, I didn't check the kernel side interface yet
1170197491 M * Bertl Hollow: you got daniel_hozac's explanation regarding clone?
1170197504 M * Hollow no?
1170197519 M * Bertl basically the 3 additional arguments are tied to 3 flags
1170197534 M * Bertl when the according flag is not set, the argument is ignored
1170197542 M * Hollow ah, i see ..
1170197553 M * Bertl that explains why it works so well :)
1170197574 M * pflanze ok at least the kernel doesn't overwrite existing files, that's good news. I think I can live with that, I've set core_pattern to /tmp/cure now.
1170197582 M * Hollow hm.. my ext3 partition wonÄt mount with tagxid .. :/
1170197604 M * Bertl Hollow: mount or _remount_?
1170197617 M * Hollow mount .. remount said not allowed in dmesg
1170197707 M * Bertl and what does mount tell you?
1170197740 M * Hollow nothing... it mounts the filesystem, but without tagxid .. 
1170197742 M * Hollow MiddleEarth linux # mount -o tagxid /dev/sda9 /vservers/
1170197742 M * Hollow MiddleEarth linux # grep /dev/sda9 /proc/mounts
1170197742 M * Hollow /dev/sda9 /vservers ext3 rw,data=ordered 0 0
1170197770 M * Bertl 2.2.x?
1170197781 M * Hollow yep.. rc8.7
1170197854 N * shuri shu_school
1170197857 M * Hollow hm.. works on my local box with xfs though .. rc8.7 too
1170197919 M * Bertl checking ext3 options as we speak ...
1170197976 M * Bertl #ifndef CONFIG_TAGGING_NONE
1170198117 Q * dna Quit: Verlassend
1170198145 M * daniel_hozac Hollow: what tagging do you use?
1170198187 M * Bertl Hollow: could you do a testme.sh -D run?
1170198211 M * Bertl ahem, -d I mean
1170198222 M * Hollow sec ..
1170198237 J * gerrit ~gerrit@bi01p1.co.us.ibm.com
1170198290 M * Hollow hehe.. no util-vserver on that machine 
1170198310 M * Bertl -qd then?
1170198324 M * Hollow utility 'vserver' could not be found.
1170198327 M * Hollow :)
1170198347 Q * yarihm Quit: Leaving
1170198561 M * Bertl hmm, okay, will fix that in the next release :)
1170198641 M * Bertl cat /proc/virtual/info maybe?
1170198715 M * Hollow VCIVersion:     0002:0200
1170198715 M * Hollow VCISyscall:     236
1170198715 M * Hollow VCIKernel:      030107f1
1170198827 M * daniel_hozac that'd be... ID24, right?
1170198868 Q * attila_ Ping timeout: 480 seconds
1170199004 M * Hollow yep
1170199132 M * pflanze (OT: Is it possible to dump a core without exiting the process? And do you know a better place discussing such questions?)
1170199197 M * pflanze (probably ##c, but it's not really C specific..)
1170199228 M * ard Huh?
1170199241 M * ard I didn't meant to paste that here... wrong channel
1170199249 M * ard ow
1170199262 M * ard somewthing wrong with  my xterm
1170199414 J * FireEgl ~FireEgl@2001:5c0:84dc:1:211:9ff:feca:b042
1170199589 Q * ensc Ping timeout: 480 seconds
1170199798 M * PowerKe I'm now on kernel version 2.6.19-vs2.2.0-rc8 with COW enabled in the kernel and that solved the touch file problems in the shell.
1170199816 M * PowerKe Now, for some reason I can append to the file from PHP, but using the PHP touch() function on the file fails
1170199973 M * daniel_hozac what does strace say PHP is doing?
1170200109 M * PowerKe I think this is the relevant part (blah is the filename):
1170200111 M * PowerKe access("blah", F_OK)                    = 0
1170200111 M * PowerKe utime("blah", [2007/01/31-00:34:00, 2007/01/31-00:34:00]) = -1 EPERM (Operation not permitted)
1170200144 M * daniel_hozac ah, utime...
1170200153 M * daniel_hozac i think we only fixed utimes.
1170200218 M * PowerKe I can work around it by using a shell command
1170200310 M * PowerKe Or first open/close the file and then use touch() to set the new mtime.
1170201037 J * [Lars_Ulrich] bbpqr@bl4-191-115.dsl.telepac.pt
1170201058 Q * [Lars_Ulrich] 
1170201279 J * shuri ~shuri@hq01.electronicbox.net