Eight Steps To Successful Troubleshooting: Step 1. Identify The Exact Issue
Eight Steps To Successful Troubleshooting: Step 1. Identify The Exact Issue
The successful technician will thoroughly study whatever data is available and
develop in-depth insight into the function of all components and how to operate
them. Finally, he or she will remember that conditions appearing to be serious
defects are often the result of improper usage or operator error.
Note: Has it ever worked? If the reported failure has never worked properly,
then treat the situation as a new installation and not a troubleshooting event.
The process and assumptions are completely different. www.flukenetworks.com 9
Step 2. Recreate the problem.
Ask yourself if you understand the symptoms, and verify the reported problem
yourself if possible. Problems are much easier to solve if they can be recreated
on demand. Seeing the problem will allow you to observe error messages and
various symptoms the user may not think important to relate, and may even
provide the opportunity for you to collect network statistics during the event.
If the problem is intermittent, instruct the user what sort of symptoms are likely
and provide a written list of what questions you are seeking answers to so the
user can gather some of the information if you are unable to respond quickly
enough to see it yourself. When possible, leave a diagnostic tool to gather
information continuously. A protocol analyzer may be left gathering all traffic
from the network and overwriting the buffer as it fills. Have the user halt its
operation and/or store the current test results from other testers immediately
upon rediscovering an intermittent problem.
Is there any normal function missing, or is there an abnormal response? Use the
data gathered by your network monitoring tools to aid you in this process.
Can the problem be duplicated from another station, or using other software
applications at the same station? Identify whether the problem is limited to one
station, or one network resource such as a printer. Move one segment closer to
the network resource and try again. If the problem goes away when you move
closer to the network resource, test or replace the intervening infrastructure
equipment.
If the problem affects an entire shared media segment, isolate the problem by
reducing the variables to the fewest possible number. Try shortening the cable
segment on a bus topology, or temporarily re-cabling a ring or star topology to
create the smallest possible network for troubleshooting purposes. Try a
different switch or hub. If the problem is on the same, shared media segment as
the network resource, try turning off or disconnect all but two stations. Once
those two are communicating, add more stations. If they are not
communicating, check the physical layer possibilities such as the termination of
the cable, the cable itself, or the specific ports used on the infrastructure
equipment (hubs and switches).www.flukenetworks.com 11
If the problem can be isolated to a single station, try a different network
adapter, a fresh copy of the network driver software (without using any of the
network software or configuration files presently found on that station delete
them if necessary). Try accessing the network using a diagnostic tool from the
existing network cable connection for that station. If the network connection
seems intact, determine if only one application exhibits the problem. Try other
applications from the same drive or file system. Compare configurations with a
nearby but operational workstation. Try a fresh copy of the application software
(again using none of the existing software or configuration files).
If only one user experiences the problem, check the network security and
permissions for that user. Find out if any changes have been made to the
network security that might affect this user. Has another user account been
deleted that this user was made security equivalent to? Has this user been
deleted from a security grouping within the network? Has an application been
moved to a new location on the network? Have there been any changes to the
system login script, or the users login script? Compare this users account with
another users account that is able to perform the desired task. Have the
affected user log in and attempt the same task from a nearby station that is not
experiencing the problem. Have the other user log in to the problem station and
try the same task.
For network hardware, it is most expedient to simply replace a part, and attempt
to repair the part later. Another option is to change the connection to a spare
port and cover or otherwise mark the suspect port. Remember than the goal is
to restore full operation of the network as soon as possible.
Two avenues exist for solving software problems. The first option is to reinstall
the problem software, eliminating possibly corrupted files and ensuring that all
required files are present. This is an excellent way to ensure that the second
option reconfiguring www.flukenetworks.com 13
the software works on the first try. Many applications allow for a software
switch that tells the installation program to disregard any existing configuration
files, which is a good way to avoid being misled by the error and duplicating it
yet again. If this option is not evident, then it is often better to remove the
application before reinstalling it.
If the problem is isolated to a single user account it is often faster to repeat the
steps necessary to grant the user access to the problem application or operation
as if the user had never been authorized before. By going through each of these
steps in a logical order, you will probably locate the missing or incorrect element
faster than by spot-checking. In some situations, it may be expedient to simply
delete the whole account and start over.
A good working relationship between network support staff and the user
community can significantly enhance your ability to keep the network running
smoothly. Failure to take users seriously, or making unprofessional and
condescending remarks can cause adversarial relations to develop, and can
undermine your ability to do your job.
There is also a saying that 75% of fixing a problem is fixing the user. If the
user does not agree that the problem has been taken to its conclusion (whether
the problem has been corrected, or you have explained to the users satisfaction
that a fix is impossible for the following technical, financial, or political
reasons), then you have not ended this support issue.