[Bigjob-users] Cannot access Redis from compute nodes on alamo.

Ole Weidner oweidner at cct.lsu.edu
Tue Jan 31 07:39:47 CST 2012


Hmm, I thought BigJob bootstraps it's own agent via 'python -c ...'? 

If this is *not* the default case and only a fall-back solution, I strongly suggest that we make it the default case, otherwise we will keep on running into trouble like this! Change BigJob in that regard should be sufficiently easier than trying to keep versions consistent across n number of machines?

My $0.02

- Ole

On Jan 30, 2012, at 4:38 PM, pradeep kumar Mantha wrote:

> Hi!
> 
> I still get the same problem.
> 
> 
> But its interesting to see the old version  on compute node c079
> 
> [pmantha at c079 ~]$ cd /N/soft/SAGA/saga/1.6/gcc-4.1.2/lib/python2.7/site-packages/
> [pmantha at c079 site-packages]$ ls -ltr
> total 96
> drwxr-xr-x 9 merzky users  4096 Nov 24 06:50 saga
> -rw-r--r-- 1 merzky users  1779 Nov 24 06:51 site.pyc
> -rw-r--r-- 1 merzky users  2362 Nov 24 06:51 site.py
> drwxr-xr-x 9 merzky users  4096 Nov 24 06:51 BigJob-0.3.2-py2.7.egg
> -rw-r--r-- 1 merzky users 37188 Nov 24 06:51 redis-2.2.4-py2.7.egg
> drwxr-xr-x 4 merzky users  4096 Nov 24 06:51 virtualenv-1.6.4-py2.7.egg
> -rw-r--r-- 1 merzky users 12888 Nov 24 06:51 threadpool-1.2.7-py2.7.egg
> -rw-r--r-- 1 merzky users 13846 Nov 24 06:51 uuid-1.30-py2.7.egg
> -rw-r--r-- 1 merzky users   314 Nov 24 06:51 easy-install.pth
> 
> 
> New version on head node.
> 
> [pmantha at login1 site-packages]$ ls -ltr
> total 960
> -rw-r--r-- 1 merzky users   1779 Jan  9 06:45 site.pyc
> -rw-r--r-- 1 merzky users   2362 Jan  9 06:45 site.py
> -rw-r--r-- 1 merzky users   3026 Jan  9 06:45 setuptools_git-0.4.2-py2.7.egg
> drwxr-xr-x 9 merzky users   4096 Jan  9 06:45 saga
> drwxr-xr-x 9 merzky users   4096 Jan  9 06:45 BigJob-0.4.23-py2.7.egg
> drwxr-xr-x 4 merzky users   4096 Jan  9 06:45 bliss-0.1.17-py2.7.egg
> -rw-r--r-- 1 merzky users  37188 Jan  9 06:45 redis-2.2.4-py2.7.egg
> drwxr-xr-x 4 merzky users   4096 Jan  9 06:45 virtualenv-1.7-py2.7.egg
> -rw-r--r-- 1 merzky users  12888 Jan  9 06:45 threadpool-1.2.7-py2.7.egg
> -rw-r--r-- 1 merzky users  13846 Jan  9 06:45 uuid-1.30-py2.7.egg
> -rw-r--r-- 1 merzky users 298430 Jan  9 06:45 paramiko_on_pypi-1.7.6-py2.7.egg
> -rw-r--r-- 1 merzky users   6406 Jan  9 06:45 openssh_wrapper-0.2-py2.7.egg
> -rw-r--r-- 1 merzky users 560496 Jan  9 06:46 pycrypto_on_pypi-2.3-py2.7-linux-x86_64.egg
> -rw-r--r-- 1 merzky users    459 Jan  9 06:46 easy-install.pth
> [pmantha at login1 site-packages]$ 
> 
> thanks
> pradeep
> 
> 
> On Mon, Jan 30, 2012 at 4:29 PM, Andre Merzky <andremerzky at gmail.com> wrote:
> Done!
> 
> Cheers, Andre.
> 
> 
> On Mon, Jan 30, 2012 at 10:58 PM, Andre Luckow <aluckow at cct.lsu.edu> wrote:
> > Hi Pradeep,
> > the agent is picking up an old CSA BJ version (which does not support
> > password based authentication for Redis):
> >
> > /N/soft/SAGA/saga/1.6/gcc-4.1.2/lib/python2.7/site-packages/BigJob-0.3.2-py2.7.egg/coordination/bigjob_coordination_redis.py
> >
> > @AndreM: Could you please remove/update this old BJ version from Alamo?
> >
> > Thanks,
> > Andre
> >
> > On Mon, Jan 30, 2012 at 10:11 PM, pradeep kumar Mantha
> > <pradeepm66 at gmail.com> wrote:
> >> Hi!
> >>
> >> I am trying to run bigjob example on alamo using pbs-ssh. But it fails due
> >> to the following error, where as it works for fork( since it runs on head
> >> node).
> >>
> >>
> >> [pmantha at login1 bj-ce3420b4-4b86-11e1-be3d-a4badb0c3696]$ vi
> >> stderr-bigjob_agent.txt
> >>
> >> DEBUG:root:Utilizing Redis Backend:
> >> redis://ILikeBigJob_wITH-REdIS@gw68.quarry.iu.teragrid.org:6379. Please make
> >> sure Redis server is configured in bigjob_coordination_redis.py
> >> DEBUG:root:Connect to Redis:
> >> ILikeBigJob_wITH-REdIS at gw68.quarry.iu.teragrid.org Port: 6379
> >> ERROR:root:Please start Redis server!
> >> Traceback (most recent call last):
> >>   File "<string>", line 35, in <module>
> >>   File
> >> "/N/soft/SAGA/saga/1.6/gcc-4.1.2/lib/python2.7/site-packages/BigJob-0.3.2-py2.7.egg/bigjob/bigjob_agent.py",
> >> line 110, in __init__
> >>     self.coordination =
> >> bigjob_coordination(server_connect_url=self.coordination_url)
> >>   File
> >> "/N/soft/SAGA/saga/1.6/gcc-4.1.2/lib/python2.7/site-packages/BigJob-0.3.2-py2.7.egg/coordination/bigjob_coordination_redis.py",
> >> line 62, in __init__
> >>     raise Exception("Please start Redis server!")
> >> Exception: Please start Redis server!
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >> Connection closed by 10.141.0.79^M
> >>
> >> thanks
> >> pradeep
> >>
> >>
> >> _______________________________________________
> >> Bigjob-users mailing list
> >> Bigjob-users at mail.cct.lsu.edu
> >> https://mail.cct.lsu.edu/mailman/listinfo/bigjob-users
> >>
> 
> 
> 
> --
> Nothing is ever easy...
> 
> _______________________________________________
> Bigjob-users mailing list
> Bigjob-users at mail.cct.lsu.edu
> https://mail.cct.lsu.edu/mailman/listinfo/bigjob-users

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cct.lsu.edu/pipermail/bigjob-users/attachments/20120131/616e5ad7/attachment-0001.html 


More information about the Bigjob-users mailing list