Page 1 of 2 12 LastLast
Results 1 to 15 of 18

Thread: Physical disk hangs at "offline pending"

  1. #1
    Henry Guest

    Physical disk hangs at "offline pending"

    Hi,

    I have a 2 node cluster that works correctly when the active server goes down.
    All resources are taken over by the passive member.

    When I try to move resources from node 1 to node 2 everything works fine as
    well.
    The problem is that when I try to move the the resources back to the
    original node all resources move except for one physical disk. This physical
    disk status remains as "offline pending". The cluster log contains many
    entries similar to what follows:
    "FmpofflineResource: offline resource <drivex>returned pending"
    until finally
    "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting state to
    failed."

    The only way for us to get the offline resource available for the other
    cluster member is to reboot the server that failed to put the physical drive
    offline.

    Any ideas would be appreciated.
    --
    Thanks in Advance,

    Henry

  2. #2
    Chuck Timon [MSFT] Guest

    Re: Physical disk hangs at "offline pending"

    Sounds like something has a handle to the drive that is preventing cluster
    from completing the Offline process. What kind of group is this disk
    resource in?

    Chuck Timon, Jr.
    Microsoft Corporation
    Longhorn Readiness Team
    This posting is provided "AS IS" with no warranties, and confers no rights.

    "Henry" <Henry@discussions.microsoft.com> wrote in message
    news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    > Hi,
    >
    > I have a 2 node cluster that works correctly when the active server goes
    > down.
    > All resources are taken over by the passive member.
    >
    > When I try to move resources from node 1 to node 2 everything works fine
    > as
    > well.
    > The problem is that when I try to move the the resources back to the
    > original node all resources move except for one physical disk. This
    > physical
    > disk status remains as "offline pending". The cluster log contains many
    > entries similar to what follows:
    > "FmpofflineResource: offline resource <drivex>returned pending"
    > until finally
    > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting state to
    > failed."
    >
    > The only way for us to get the offline resource available for the other
    > cluster member is to reboot the server that failed to put the physical
    > drive
    > offline.
    >
    > Any ideas would be appreciated.
    > --
    > Thanks in Advance,
    >
    > Henry



  3. #3
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    We have installed Oracle failsafe on this cluster and the drive in question
    is part of the "Cluster Group" set of resources. The oracle database resides
    on this SAN drive. I have stopped all oracle services on the server giving me
    the problems and the disk still does not go offline to enable a failover
    unless the server is shut down.
    I suppose there must be something else preventing the failover and am trying
    to determine what could be preventing this disk from being released. The
    server in question does have exclusive rights to this physical disk when it
    is the active member.
    If anyone has any idea as to how I might determine if some process is
    refusing to release it's resources please make a suggestion.
    Is there a way to increase the logging level of the cluster and should that
    give me a better indication of what may be the problem? (the logs are fairly
    hard to decipher even at the default logging level).

    Thanks in Advance,
    --
    Henry


    "Chuck Timon [MSFT]" wrote:

    > Sounds like something has a handle to the drive that is preventing cluster
    > from completing the Offline process. What kind of group is this disk
    > resource in?
    >
    > Chuck Timon, Jr.
    > Microsoft Corporation
    > Longhorn Readiness Team
    > This posting is provided "AS IS" with no warranties, and confers no rights.
    >
    > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    > > Hi,
    > >
    > > I have a 2 node cluster that works correctly when the active server goes
    > > down.
    > > All resources are taken over by the passive member.
    > >
    > > When I try to move resources from node 1 to node 2 everything works fine
    > > as
    > > well.
    > > The problem is that when I try to move the the resources back to the
    > > original node all resources move except for one physical disk. This
    > > physical
    > > disk status remains as "offline pending". The cluster log contains many
    > > entries similar to what follows:
    > > "FmpofflineResource: offline resource <drivex>returned pending"
    > > until finally
    > > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting state to
    > > failed."
    > >
    > > The only way for us to get the offline resource available for the other
    > > cluster member is to reboot the server that failed to put the physical
    > > drive
    > > offline.
    > >
    > > Any ideas would be appreciated.
    > > --
    > > Thanks in Advance,
    > >
    > > Henry

    >
    >


  4. #4
    Edwin vMierlo Guest

    Re: Physical disk hangs at "offline pending"

    Henry,

    just a few questions:
    - is this Oracle FileSystem (ocfs.sys) ?
    - what errors do you see in the system event log (please post) ?
    - what errors do you see in the cluster.log (please post) ?
    - once the disk is in off-line pending state... what other cluster resources
    are off-line pending ?

    thnx,
    edwin.




    "Henry" <Henry@discussions.microsoft.com> wrote in message
    news:CE4E6FBE-DAE2-425C-B2F5-526812E37245@microsoft.com...
    > Hi,
    >
    > We have installed Oracle failsafe on this cluster and the drive in

    question
    > is part of the "Cluster Group" set of resources. The oracle database

    resides
    > on this SAN drive. I have stopped all oracle services on the server giving

    me
    > the problems and the disk still does not go offline to enable a failover
    > unless the server is shut down.
    > I suppose there must be something else preventing the failover and am

    trying
    > to determine what could be preventing this disk from being released. The
    > server in question does have exclusive rights to this physical disk when

    it
    > is the active member.
    > If anyone has any idea as to how I might determine if some process is
    > refusing to release it's resources please make a suggestion.
    > Is there a way to increase the logging level of the cluster and should

    that
    > give me a better indication of what may be the problem? (the logs are

    fairly
    > hard to decipher even at the default logging level).
    >
    > Thanks in Advance,
    > --
    > Henry
    >
    >
    > "Chuck Timon [MSFT]" wrote:
    >
    > > Sounds like something has a handle to the drive that is preventing

    cluster
    > > from completing the Offline process. What kind of group is this disk
    > > resource in?
    > >
    > > Chuck Timon, Jr.
    > > Microsoft Corporation
    > > Longhorn Readiness Team
    > > This posting is provided "AS IS" with no warranties, and confers no

    rights.
    > >
    > > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > > news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    > > > Hi,
    > > >
    > > > I have a 2 node cluster that works correctly when the active server

    goes
    > > > down.
    > > > All resources are taken over by the passive member.
    > > >
    > > > When I try to move resources from node 1 to node 2 everything works

    fine
    > > > as
    > > > well.
    > > > The problem is that when I try to move the the resources back to the
    > > > original node all resources move except for one physical disk. This
    > > > physical
    > > > disk status remains as "offline pending". The cluster log contains

    many
    > > > entries similar to what follows:
    > > > "FmpofflineResource: offline resource <drivex>returned pending"
    > > > until finally
    > > > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting

    state to
    > > > failed."
    > > >
    > > > The only way for us to get the offline resource available for the

    other
    > > > cluster member is to reboot the server that failed to put the physical
    > > > drive
    > > > offline.
    > > >
    > > > Any ideas would be appreciated.
    > > > --
    > > > Thanks in Advance,
    > > >
    > > > Henry

    > >
    > >




  5. #5
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    1) No. The file system is NTFS.
    2) Event ID 1145 - Cluster resource OracleDB timed out. (Physical disk name)
    Event ID 1205 - The cluster service failed to bring the resource group
    "OracleDB" completely online or offline.
    3)00000f7c.00000810::2007/03/01-15:51:35.805 INFO [FM] FmpRmOfflineResource:
    RmOffline() for 5fa5cc41-66f4-4b14-9d9c-32c7f67347a5 returned error 997. ---
    0000c20.00000f18::2007/03/01-17:56:22.860 INFO [FM] FmpRmOfflineResource:
    RmOffline() for 667e7691-4049-44fe-9380-c620cd79971d returned error 997

    The following entry is repeated:
    00000c20.00000a60::2007/03/01-17:58:25.379 INFO [FM] FmpCompleteMoveGroup:
    Exit, status = 997
    00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpCompleteMoveGroup:
    Completing the move for group BANCTEC to node 1 (1)
    00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpOfflineResource:
    Offline resource <OracleDB> returned pending

    until finally:
    00000c20.00000a60::2007/03/01-17:59:40.276 INFO [FM] FmpCompleteMoveGroup:
    Exit, status = 997
    000002f4.00000388::2007/03/01-17:59:40.757 WARN [RM] RmpTimerThread:
    Resource OracleDB pending timed out, CP 3 - setting state to failed.

    This last messege may be the result of us getting fed up and shutting down
    the server that will not release the physical drive.
    4) The other resources are offline. On the odd occasion another physical
    disk displays the pending offline symtoms as well.

    Thanks in Advance

    --
    Henry


    "Edwin vMierlo" wrote:

    > Henry,
    >
    > just a few questions:
    > - is this Oracle FileSystem (ocfs.sys) ?
    > - what errors do you see in the system event log (please post) ?
    > - what errors do you see in the cluster.log (please post) ?
    > - once the disk is in off-line pending state... what other cluster resources
    > are off-line pending ?
    >
    > thnx,
    > edwin.
    >
    >
    >
    >
    > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > news:CE4E6FBE-DAE2-425C-B2F5-526812E37245@microsoft.com...
    > > Hi,
    > >
    > > We have installed Oracle failsafe on this cluster and the drive in

    > question
    > > is part of the "Cluster Group" set of resources. The oracle database

    > resides
    > > on this SAN drive. I have stopped all oracle services on the server giving

    > me
    > > the problems and the disk still does not go offline to enable a failover
    > > unless the server is shut down.
    > > I suppose there must be something else preventing the failover and am

    > trying
    > > to determine what could be preventing this disk from being released. The
    > > server in question does have exclusive rights to this physical disk when

    > it
    > > is the active member.
    > > If anyone has any idea as to how I might determine if some process is
    > > refusing to release it's resources please make a suggestion.
    > > Is there a way to increase the logging level of the cluster and should

    > that
    > > give me a better indication of what may be the problem? (the logs are

    > fairly
    > > hard to decipher even at the default logging level).
    > >
    > > Thanks in Advance,
    > > --
    > > Henry
    > >
    > >
    > > "Chuck Timon [MSFT]" wrote:
    > >
    > > > Sounds like something has a handle to the drive that is preventing

    > cluster
    > > > from completing the Offline process. What kind of group is this disk
    > > > resource in?
    > > >
    > > > Chuck Timon, Jr.
    > > > Microsoft Corporation
    > > > Longhorn Readiness Team
    > > > This posting is provided "AS IS" with no warranties, and confers no

    > rights.
    > > >
    > > > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > > > news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    > > > > Hi,
    > > > >
    > > > > I have a 2 node cluster that works correctly when the active server

    > goes
    > > > > down.
    > > > > All resources are taken over by the passive member.
    > > > >
    > > > > When I try to move resources from node 1 to node 2 everything works

    > fine
    > > > > as
    > > > > well.
    > > > > The problem is that when I try to move the the resources back to the
    > > > > original node all resources move except for one physical disk. This
    > > > > physical
    > > > > disk status remains as "offline pending". The cluster log contains

    > many
    > > > > entries similar to what follows:
    > > > > "FmpofflineResource: offline resource <drivex>returned pending"
    > > > > until finally
    > > > > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting

    > state to
    > > > > failed."
    > > > >
    > > > > The only way for us to get the offline resource available for the

    > other
    > > > > cluster member is to reboot the server that failed to put the physical
    > > > > drive
    > > > > offline.
    > > > >
    > > > > Any ideas would be appreciated.
    > > > > --
    > > > > Thanks in Advance,
    > > > >
    > > > > Henry
    > > >
    > > >

    >
    >
    >


  6. #6
    Chuck Timon [MSFT] Guest

    Re: Physical disk hangs at "offline pending"

    Answer me this, when you look at the properties for this disk resource and
    inspect the 'dependencies' tab...is there anything listed there?

    Chuck Timon, Jr.
    Microsoft Corporation
    Longhorn Readiness Team
    This posting is provided "AS IS" with no warranties, and confers no rights.

    "Henry" <Henry@discussions.microsoft.com> wrote in message
    news:056FAD68-9EF5-4877-93B1-841B20EED633@microsoft.com...
    > Hi,
    >
    > 1) No. The file system is NTFS.
    > 2) Event ID 1145 - Cluster resource OracleDB timed out. (Physical disk
    > name)
    > Event ID 1205 - The cluster service failed to bring the resource group
    > "OracleDB" completely online or offline.
    > 3)00000f7c.00000810::2007/03/01-15:51:35.805 INFO [FM]
    > FmpRmOfflineResource:
    > RmOffline() for 5fa5cc41-66f4-4b14-9d9c-32c7f67347a5 returned error
    > 997. ---
    > 0000c20.00000f18::2007/03/01-17:56:22.860 INFO [FM] FmpRmOfflineResource:
    > RmOffline() for 667e7691-4049-44fe-9380-c620cd79971d returned error 997
    >
    > The following entry is repeated:
    > 00000c20.00000a60::2007/03/01-17:58:25.379 INFO [FM] FmpCompleteMoveGroup:
    > Exit, status = 997
    > 00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpCompleteMoveGroup:
    > Completing the move for group BANCTEC to node 1 (1)
    > 00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpOfflineResource:
    > Offline resource <OracleDB> returned pending
    >
    > until finally:
    > 00000c20.00000a60::2007/03/01-17:59:40.276 INFO [FM] FmpCompleteMoveGroup:
    > Exit, status = 997
    > 000002f4.00000388::2007/03/01-17:59:40.757 WARN [RM] RmpTimerThread:
    > Resource OracleDB pending timed out, CP 3 - setting state to failed.
    >
    > This last messege may be the result of us getting fed up and shutting down
    > the server that will not release the physical drive.
    > 4) The other resources are offline. On the odd occasion another physical
    > disk displays the pending offline symtoms as well.
    >
    > Thanks in Advance
    >
    > --
    > Henry
    >
    >
    > "Edwin vMierlo" wrote:
    >
    >> Henry,
    >>
    >> just a few questions:
    >> - is this Oracle FileSystem (ocfs.sys) ?
    >> - what errors do you see in the system event log (please post) ?
    >> - what errors do you see in the cluster.log (please post) ?
    >> - once the disk is in off-line pending state... what other cluster
    >> resources
    >> are off-line pending ?
    >>
    >> thnx,
    >> edwin.
    >>
    >>
    >>
    >>
    >> "Henry" <Henry@discussions.microsoft.com> wrote in message
    >> news:CE4E6FBE-DAE2-425C-B2F5-526812E37245@microsoft.com...
    >> > Hi,
    >> >
    >> > We have installed Oracle failsafe on this cluster and the drive in

    >> question
    >> > is part of the "Cluster Group" set of resources. The oracle database

    >> resides
    >> > on this SAN drive. I have stopped all oracle services on the server
    >> > giving

    >> me
    >> > the problems and the disk still does not go offline to enable a
    >> > failover
    >> > unless the server is shut down.
    >> > I suppose there must be something else preventing the failover and am

    >> trying
    >> > to determine what could be preventing this disk from being released.
    >> > The
    >> > server in question does have exclusive rights to this physical disk
    >> > when

    >> it
    >> > is the active member.
    >> > If anyone has any idea as to how I might determine if some process is
    >> > refusing to release it's resources please make a suggestion.
    >> > Is there a way to increase the logging level of the cluster and should

    >> that
    >> > give me a better indication of what may be the problem? (the logs are

    >> fairly
    >> > hard to decipher even at the default logging level).
    >> >
    >> > Thanks in Advance,
    >> > --
    >> > Henry
    >> >
    >> >
    >> > "Chuck Timon [MSFT]" wrote:
    >> >
    >> > > Sounds like something has a handle to the drive that is preventing

    >> cluster
    >> > > from completing the Offline process. What kind of group is this disk
    >> > > resource in?
    >> > >
    >> > > Chuck Timon, Jr.
    >> > > Microsoft Corporation
    >> > > Longhorn Readiness Team
    >> > > This posting is provided "AS IS" with no warranties, and confers no

    >> rights.
    >> > >
    >> > > "Henry" <Henry@discussions.microsoft.com> wrote in message
    >> > > news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    >> > > > Hi,
    >> > > >
    >> > > > I have a 2 node cluster that works correctly when the active server

    >> goes
    >> > > > down.
    >> > > > All resources are taken over by the passive member.
    >> > > >
    >> > > > When I try to move resources from node 1 to node 2 everything works

    >> fine
    >> > > > as
    >> > > > well.
    >> > > > The problem is that when I try to move the the resources back to
    >> > > > the
    >> > > > original node all resources move except for one physical disk. This
    >> > > > physical
    >> > > > disk status remains as "offline pending". The cluster log contains

    >> many
    >> > > > entries similar to what follows:
    >> > > > "FmpofflineResource: offline resource <drivex>returned pending"
    >> > > > until finally
    >> > > > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting

    >> state to
    >> > > > failed."
    >> > > >
    >> > > > The only way for us to get the offline resource available for the

    >> other
    >> > > > cluster member is to reboot the server that failed to put the
    >> > > > physical
    >> > > > drive
    >> > > > offline.
    >> > > >
    >> > > > Any ideas would be appreciated.
    >> > > > --
    >> > > > Thanks in Advance,
    >> > > >
    >> > > > Henry
    >> > >
    >> > >

    >>
    >>
    >>



  7. #7
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    No, there are no dependencies for this resource.
    That is what I expected for a physical disk resource.

    Thanks in Advance.
    --
    Henry


    "Chuck Timon [MSFT]" wrote:

    > Answer me this, when you look at the properties for this disk resource and
    > inspect the 'dependencies' tab...is there anything listed there?
    >
    > Chuck Timon, Jr.
    > Microsoft Corporation
    > Longhorn Readiness Team
    > This posting is provided "AS IS" with no warranties, and confers no rights.
    >
    > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > news:056FAD68-9EF5-4877-93B1-841B20EED633@microsoft.com...
    > > Hi,
    > >
    > > 1) No. The file system is NTFS.
    > > 2) Event ID 1145 - Cluster resource OracleDB timed out. (Physical disk
    > > name)
    > > Event ID 1205 - The cluster service failed to bring the resource group
    > > "OracleDB" completely online or offline.
    > > 3)00000f7c.00000810::2007/03/01-15:51:35.805 INFO [FM]
    > > FmpRmOfflineResource:
    > > RmOffline() for 5fa5cc41-66f4-4b14-9d9c-32c7f67347a5 returned error
    > > 997. ---
    > > 0000c20.00000f18::2007/03/01-17:56:22.860 INFO [FM] FmpRmOfflineResource:
    > > RmOffline() for 667e7691-4049-44fe-9380-c620cd79971d returned error 997
    > >
    > > The following entry is repeated:
    > > 00000c20.00000a60::2007/03/01-17:58:25.379 INFO [FM] FmpCompleteMoveGroup:
    > > Exit, status = 997
    > > 00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpCompleteMoveGroup:
    > > Completing the move for group BANCTEC to node 1 (1)
    > > 00000c20.00000a60::2007/03/01-17:58:25.875 INFO [FM] FmpOfflineResource:
    > > Offline resource <OracleDB> returned pending
    > >
    > > until finally:
    > > 00000c20.00000a60::2007/03/01-17:59:40.276 INFO [FM] FmpCompleteMoveGroup:
    > > Exit, status = 997
    > > 000002f4.00000388::2007/03/01-17:59:40.757 WARN [RM] RmpTimerThread:
    > > Resource OracleDB pending timed out, CP 3 - setting state to failed.
    > >
    > > This last messege may be the result of us getting fed up and shutting down
    > > the server that will not release the physical drive.
    > > 4) The other resources are offline. On the odd occasion another physical
    > > disk displays the pending offline symtoms as well.
    > >
    > > Thanks in Advance
    > >
    > > --
    > > Henry
    > >
    > >
    > > "Edwin vMierlo" wrote:
    > >
    > >> Henry,
    > >>
    > >> just a few questions:
    > >> - is this Oracle FileSystem (ocfs.sys) ?
    > >> - what errors do you see in the system event log (please post) ?
    > >> - what errors do you see in the cluster.log (please post) ?
    > >> - once the disk is in off-line pending state... what other cluster
    > >> resources
    > >> are off-line pending ?
    > >>
    > >> thnx,
    > >> edwin.
    > >>
    > >>
    > >>
    > >>
    > >> "Henry" <Henry@discussions.microsoft.com> wrote in message
    > >> news:CE4E6FBE-DAE2-425C-B2F5-526812E37245@microsoft.com...
    > >> > Hi,
    > >> >
    > >> > We have installed Oracle failsafe on this cluster and the drive in
    > >> question
    > >> > is part of the "Cluster Group" set of resources. The oracle database
    > >> resides
    > >> > on this SAN drive. I have stopped all oracle services on the server
    > >> > giving
    > >> me
    > >> > the problems and the disk still does not go offline to enable a
    > >> > failover
    > >> > unless the server is shut down.
    > >> > I suppose there must be something else preventing the failover and am
    > >> trying
    > >> > to determine what could be preventing this disk from being released.
    > >> > The
    > >> > server in question does have exclusive rights to this physical disk
    > >> > when
    > >> it
    > >> > is the active member.
    > >> > If anyone has any idea as to how I might determine if some process is
    > >> > refusing to release it's resources please make a suggestion.
    > >> > Is there a way to increase the logging level of the cluster and should
    > >> that
    > >> > give me a better indication of what may be the problem? (the logs are
    > >> fairly
    > >> > hard to decipher even at the default logging level).
    > >> >
    > >> > Thanks in Advance,
    > >> > --
    > >> > Henry
    > >> >
    > >> >
    > >> > "Chuck Timon [MSFT]" wrote:
    > >> >
    > >> > > Sounds like something has a handle to the drive that is preventing
    > >> cluster
    > >> > > from completing the Offline process. What kind of group is this disk
    > >> > > resource in?
    > >> > >
    > >> > > Chuck Timon, Jr.
    > >> > > Microsoft Corporation
    > >> > > Longhorn Readiness Team
    > >> > > This posting is provided "AS IS" with no warranties, and confers no
    > >> rights.
    > >> > >
    > >> > > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > >> > > news:4429C77B-C125-4677-8F00-C2D96D014716@microsoft.com...
    > >> > > > Hi,
    > >> > > >
    > >> > > > I have a 2 node cluster that works correctly when the active server
    > >> goes
    > >> > > > down.
    > >> > > > All resources are taken over by the passive member.
    > >> > > >
    > >> > > > When I try to move resources from node 1 to node 2 everything works
    > >> fine
    > >> > > > as
    > >> > > > well.
    > >> > > > The problem is that when I try to move the the resources back to
    > >> > > > the
    > >> > > > original node all resources move except for one physical disk. This
    > >> > > > physical
    > >> > > > disk status remains as "offline pending". The cluster log contains
    > >> many
    > >> > > > entries similar to what follows:
    > >> > > > "FmpofflineResource: offline resource <drivex>returned pending"
    > >> > > > until finally
    > >> > > > "RmpTimerThread: Resource drivex pending timed out, CP 3 - seting
    > >> state to
    > >> > > > failed."
    > >> > > >
    > >> > > > The only way for us to get the offline resource available for the
    > >> other
    > >> > > > cluster member is to reboot the server that failed to put the
    > >> > > > physical
    > >> > > > drive
    > >> > > > offline.
    > >> > > >
    > >> > > > Any ideas would be appreciated.
    > >> > > > --
    > >> > > > Thanks in Advance,
    > >> > > >
    > >> > > > Henry
    > >> > >
    > >> > >
    > >>
    > >>
    > >>

    >
    >


  8. #8
    kkantchev@hotmail.com Guest

    Re: Physical disk hangs at "offline pending"

    Hi,
    In order to determine who is holding the HD resource, do the following.
    1. Restart and bring everything up and running on node 1.
    2. Take offline everything except the cluster.
    3. Do failover to Node 2
    4. Bring online only the HD in questin
    5. Do failover back to Node1

    If step 5 is not succeseful, then there is process external to the cluster
    that i causing the problem. Antivirus scan or something like that.
    If step 5 is succeseful, then the problem is in your Oracle setup. In this
    case you have to send more details about your configuration.

    Hope that helps

    Krass

  9. #9
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    Pardon my ignorance but I'm not sure what is meant by "take offline
    everything but the cluster". If that means the cluster group, it was modified
    and contains:
    Cluster IP
    Cluster Name
    Quorum disk physical disk
    Oracle log physical disk
    Oracle DB physical disk
    Images physical disk
    Oracle Services for MSCS
    Several file shares dependant on the physical disks.

    The other groups (2 for different instances) contain:
    Oracle Database
    Oracle failover IP address
    Oracle Listener
    Oracle network name

    This is not how I originally configured it but it was changed by the DBA
    that installed Oracle failsafe.

    The two servers are identical (HW and OS).

    Thanks in Advance,
    --
    Henry


    "kkantchev@hotmail.com" wrote:

    > Hi,
    > In order to determine who is holding the HD resource, do the following.
    > 1. Restart and bring everything up and running on node 1.
    > 2. Take offline everything except the cluster.
    > 3. Do failover to Node 2
    > 4. Bring online only the HD in questin
    > 5. Do failover back to Node1
    >
    > If step 5 is not succeseful, then there is process external to the cluster
    > that i causing the problem. Antivirus scan or something like that.
    > If step 5 is succeseful, then the problem is in your Oracle setup. In this
    > case you have to send more details about your configuration.
    >
    > Hope that helps
    >
    > Krass


  10. #10
    kkantchev@hotmail.com Guest

    Re: Physical disk hangs at "offline pending"

    Henry,
    By "cluster" I ment
    Cluster IP
    Cluster Name
    Quorum physical disk
    That's it. Take all other resources offline.

    BTW, as best MSCS practice you should not use the cluster goroup for
    anything else but the above, + MSDTC maybe. In fact, your problem may be
    related to that.

    Hope you manage to figure out what's holding that drive.


  11. #11
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    OK I had everything offline except the cluster IP, Name and Quorum.
    Failed over to second node with no issues (This is always the case)
    Then I brought the physical disk(s) in question (2 actually) on line.
    Tried a failover back to the first member of the cluster and the same thing
    happened. The physical disk(s) in question stayed in the "offline pending"
    state.

    Thanks in Advance,
    --
    Henry


    "kkantchev@hotmail.com" wrote:

    > Henry,
    > By "cluster" I ment
    > Cluster IP
    > Cluster Name
    > Quorum physical disk
    > That's it. Take all other resources offline.
    >
    > BTW, as best MSCS practice you should not use the cluster goroup for
    > anything else but the above, + MSDTC maybe. In fact, your problem may be
    > related to that.
    >
    > Hope you manage to figure out what's holding that drive.
    >


  12. #12
    Edwin vMierlo Guest

    Re: Physical disk hangs at "offline pending"

    Henry,

    I think here we go wrong.

    The disks your Oracle instances are using are not in the same group of your
    Oracle instances.

    so, this is what I think here is happening.
    Cluster group online on node 1
    Oracle group online on node 1

    Oracle is happy, as its disks are accessible.... as they are all on node 1.

    Cluster group wants to move over, tries to offline the disks, however your
    Oracle instances still have a handle open on the disks, and therefore the
    disk remains off-line pending until time out.

    I seriously believe you need to change your configuration.

    The cluster group should only contain the following :
    1) Quorum disk
    2) Cluster IP address
    3) Cluster Network Name

    Each of your Oracle groups need the following
    1) IP address
    2) Network Name
    3) as many disks as this Oracle instance need, these disks can only be used
    by this instance exclusively
    4) Oracle services / applications for this instance.

    Other than this, follow Oracle instructions for building your cluster,
    invoke Oracle support for getting more information on how to configure your
    clustered applications.

    Rgds,
    Edwin.


    "Henry" <Henry@discussions.microsoft.com> wrote in message
    news:DEE12401-3D9C-4879-8CEC-792EB43A6BBA@microsoft.com...
    > Hi,
    >
    > Pardon my ignorance but I'm not sure what is meant by "take offline
    > everything but the cluster". If that means the cluster group, it was

    modified
    > and contains:
    > Cluster IP
    > Cluster Name
    > Quorum disk physical disk
    > Oracle log physical disk
    > Oracle DB physical disk
    > Images physical disk
    > Oracle Services for MSCS
    > Several file shares dependant on the physical disks.
    >
    > The other groups (2 for different instances) contain:
    > Oracle Database
    > Oracle failover IP address
    > Oracle Listener
    > Oracle network name
    >
    > This is not how I originally configured it but it was changed by the DBA
    > that installed Oracle failsafe.
    >
    > The two servers are identical (HW and OS).
    >
    > Thanks in Advance,
    > --
    > Henry
    >
    >
    > "kkantchev@hotmail.com" wrote:
    >
    > > Hi,
    > > In order to determine who is holding the HD resource, do the following.
    > > 1. Restart and bring everything up and running on node 1.
    > > 2. Take offline everything except the cluster.
    > > 3. Do failover to Node 2
    > > 4. Bring online only the HD in questin
    > > 5. Do failover back to Node1
    > >
    > > If step 5 is not succeseful, then there is process external to the

    cluster
    > > that i causing the problem. Antivirus scan or something like that.
    > > If step 5 is succeseful, then the problem is in your Oracle setup. In

    this
    > > case you have to send more details about your configuration.
    > >
    > > Hope that helps
    > >
    > > Krass




  13. #13
    Henry Guest

    Re: Physical disk hangs at "offline pending"

    Hi,

    You are probably right about the group configuration and I will do some
    research into it today.

    Since the group move works fine from node 1 to node 2 but doesn't work going
    from node 2 to node 1 I'm not sure that group configuration is causing my
    problems.

    Even with all the oracle services offline and only the Cluster IP, name,
    quorum and the 1 physical disk online a move from node 2 to node 1 leaves the
    physical disk as "offline pending" while a move from node 1 to node 2 works
    fine.

    I am going to seeif I can find any differences that are not obvious between
    the nodes first. Then I will try re-arranging the groups.

    Any other suggestions would be appreciated.

    Thanks in Advance,
    --
    Henry


    "Edwin vMierlo" wrote:

    > Henry,
    >
    > I think here we go wrong.
    >
    > The disks your Oracle instances are using are not in the same group of your
    > Oracle instances.
    >
    > so, this is what I think here is happening.
    > Cluster group online on node 1
    > Oracle group online on node 1
    >
    > Oracle is happy, as its disks are accessible.... as they are all on node 1.
    >
    > Cluster group wants to move over, tries to offline the disks, however your
    > Oracle instances still have a handle open on the disks, and therefore the
    > disk remains off-line pending until time out.
    >
    > I seriously believe you need to change your configuration.
    >
    > The cluster group should only contain the following :
    > 1) Quorum disk
    > 2) Cluster IP address
    > 3) Cluster Network Name
    >
    > Each of your Oracle groups need the following
    > 1) IP address
    > 2) Network Name
    > 3) as many disks as this Oracle instance need, these disks can only be used
    > by this instance exclusively
    > 4) Oracle services / applications for this instance.
    >
    > Other than this, follow Oracle instructions for building your cluster,
    > invoke Oracle support for getting more information on how to configure your
    > clustered applications.
    >
    > Rgds,
    > Edwin.
    >
    >
    > "Henry" <Henry@discussions.microsoft.com> wrote in message
    > news:DEE12401-3D9C-4879-8CEC-792EB43A6BBA@microsoft.com...
    > > Hi,
    > >
    > > Pardon my ignorance but I'm not sure what is meant by "take offline
    > > everything but the cluster". If that means the cluster group, it was

    > modified
    > > and contains:
    > > Cluster IP
    > > Cluster Name
    > > Quorum disk physical disk
    > > Oracle log physical disk
    > > Oracle DB physical disk
    > > Images physical disk
    > > Oracle Services for MSCS
    > > Several file shares dependant on the physical disks.
    > >
    > > The other groups (2 for different instances) contain:
    > > Oracle Database
    > > Oracle failover IP address
    > > Oracle Listener
    > > Oracle network name
    > >
    > > This is not how I originally configured it but it was changed by the DBA
    > > that installed Oracle failsafe.
    > >
    > > The two servers are identical (HW and OS).
    > >
    > > Thanks in Advance,
    > > --
    > > Henry
    > >
    > >
    > > "kkantchev@hotmail.com" wrote:
    > >
    > > > Hi,
    > > > In order to determine who is holding the HD resource, do the following.
    > > > 1. Restart and bring everything up and running on node 1.
    > > > 2. Take offline everything except the cluster.
    > > > 3. Do failover to Node 2
    > > > 4. Bring online only the HD in questin
    > > > 5. Do failover back to Node1
    > > >
    > > > If step 5 is not succeseful, then there is process external to the

    > cluster
    > > > that i causing the problem. Antivirus scan or something like that.
    > > > If step 5 is succeseful, then the problem is in your Oracle setup. In

    > this
    > > > case you have to send more details about your configuration.
    > > >
    > > > Hope that helps
    > > >
    > > > Krass

    >
    >
    >


  14. #14
    Edwin vMierlo Guest

    Re: Physical disk hangs at "offline pending"



    >
    > Any other suggestions would be appreciated.
    >
    > Thanks in Advance,
    > --
    > Henry
    >


    Get Oracle support involved and let them tell you how to configure your
    groups/resources/disks.

    rgds,
    Edwin.



  15. #15
    John Toner [MVP] Guest

    Re: Physical disk hangs at "offline pending"

    What happens if you have only one disk online, or only attempt to take a
    single disk offline? Does this work?

    If it works with a single disk at a time, as a work-around you could make
    one of your disk resources depend upon the other so that only a single disk
    goes offline at a time.

    If the above works for you, I have seen this once at a customer's site but
    never really found a root cause. My customer wound up re-installing their
    O/S and this resolved the issue.

    Regards,
    John


    "Henry" <Henry@discussions.microsoft.com> wrote in message
    news:C82BB347-B2E4-419D-AC34-82737C8142CC@microsoft.com...
    > Hi,
    >
    > OK I had everything offline except the cluster IP, Name and Quorum.
    > Failed over to second node with no issues (This is always the case)
    > Then I brought the physical disk(s) in question (2 actually) on line.
    > Tried a failover back to the first member of the cluster and the same

    thing
    > happened. The physical disk(s) in question stayed in the "offline pending"
    > state.
    >
    > Thanks in Advance,
    > --




Page 1 of 2 12 LastLast

Similar Threads

  1. Replies: 4
    Last Post: 04-04-2012, 04:42 AM
  2. "Add request pending" issue in Yahoo Messenger
    By Galimberti in forum Technology & Internet
    Replies: 3
    Last Post: 17-06-2010, 11:42 AM
  3. Replies: 2
    Last Post: 20-04-2009, 09:57 PM
  4. Replies: 3
    Last Post: 18-03-2008, 10:02 AM
  5. Replies: 2
    Last Post: 07-03-2007, 08:16 AM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Page generated in 1,710,849,238.62201 seconds with 16 queries