I have a 2 part question that is somewhat related to each other.
1. We have a 2012 SQL cluster set up on decent amouth of servers. They are all connected to the same switch. They are networked teamed, cluster heartbeat. etc..
We had our core switch cause a quick blimp. Core is few segments down from sql servers so technically they shouldn't sensed that there was no connection unless they were trying to get to another VLAN. So the odd thing was that 3 clusters decided to fail over while 10+ other clusters didn't. I quess i'm trying to understand why failover was triggered on this 3 servers.
2. Once cluster was failing over we had DTC produce this error "
The Distributed Transaction Coordinator (358c020b-dab6-42fb-871c-705578ddb1f0) service terminated unexpectedly. It has done this 1 time(s)."
30 seconds later the instance crashed "
A timeout (30000 milliseconds) was reached while waiting for a transaction response from the MSSQLSERVER service."
and
"The SQL Server (MSSQLSERVER) service terminated unexpectedly. It has done this 1 time(s)."
Any insight into this?