You are here

cryptic error "Got some signal... It is:15" -- an issue with 'fixbb', or something else?

6 posts / 0 new
Last post
cryptic error "Got some signal... It is:15" -- an issue with 'fixbb', or something else?
#1

Hello Forum

I'm trying to run fixbb on my cluster here, and everything seems to have been going well for a while, but it suddenly stopped and spit out the following in the log file:

^[[0mcore.pack.interaction_graph.interaction_graph_factory: (0) ^[[0mInstantiating PDInteractionGraph
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!
Got some signal... It is:15
Process was terminated!

--------------------------------------------------------------------------
mpiexec noticed that process rank 7 with PID 20123 on node baldr-3 exited on signal 9 (Killed).
--------------------------------------------------------------------------

 

now i'm not sure what this signal means or what the source of it is (e.g., something wrong with how I called fixbb, or mpiexec, or missing libraries).  Any insights would be most appreciated.

Thanks! 

Category: 
Post Situation: 
Wed, 2019-10-16 14:59
dantimatter

Signal 15 is SIGTERM.  This is a Linux/unix thing not a Rosetta thing.  It means something outside of Rosetta issued a command to Rosetta to stop running. 

One of your processed got signal 9, which is SIGKILL, which again means something external to rosetta killed it.  9 is a "harder kill" than 15.

If you are on a cluster I'd guess the most likely thing is you ran out your allocation of computer time, but there's not enough data here to know.  

Wed, 2019-10-16 15:03
smlewis

fantastic thanks!  that was easy enough to solve :) 

I was worried it was something more complicated 

 

Wed, 2019-10-16 15:05
dantimatter

Neverthless, that error message is pretty awful and confusing for users.  I'm going to open a pull request to add a clearer error message.

Wed, 2019-10-16 16:36
vmulligan

OK.  So the bad news is, I don't know why your crash occurs.  The good news is, the next time it happens, if you use the next pu blic release of Rosetta, the error emssage will be less cryptic!

Fri, 2019-10-18 12:49
vmulligan

thanks! 

Wed, 2019-10-30 09:29
dantimatter