Randstad NOC Support in Boston, Massachusetts
date posted:Monday, June 19, 2017
job type:Temp to Perm
About our NOC:
We are actively involved in maintenance and changes, including code deploys.
We are proactive and look to improve all of Production Operations and Engineering.
We have a wide range of roles and tasks within the team.
We coordinate incident management, recommend changes, and do technical projects.
We make our own alerts, find anomalies, and fix things, and ask why something broke.
About you ( Requirements ):
You are self-directed, analytical, and do your best even if no one is watching.
You have prior technical experience, in a NOC or similar responsive role.
You can triage multiple issues simultaneously and work well under pressure.
You want to expand your skills and learn new things.
You are not afraid to ask questions or speak up.
You can communicate well with other technical people as well as business teams.
You have a sense of humor and are an accurate shot with a Nerf gun.
We have a wide variety of prior backgrounds in the NOC, so our Qualifications are not must-haves, but the more the merrier. Experience with:
Network troubleshooting and concepts, such as proxy servers and load balancing.
Network monitoring software, such as Zabbix, ElasticSearch, Logstash, and Kibana (ELK stack), or other tools such as Nagios.
Linux, Bash, and general comfort with the command line.
Windows Servers, Microsoft SQL, and experience with Active Directory and PowerShell.
Version control, merging, reverting, and rolling back trains, and an understanding of Git, SVN, or other version control software.
Time series data platforms like Graphite, Grafana, OpenTSDB, and StatsD.
Other technologies are a plus: RabbitMQ, Jenkins, Puppet, Chef, or similar systems.
Ticketing systems like Service Now, Jira, or similar, and collaboration tools like Confluence.
The ability to translate end user language into your technical understanding.
The Second Shift team in the NOC covers from 3PM to 11:30PM or later as needed. The primary Responsibilities are:
Keep the lights on across our production systems from a wide variety of alerting sources and your sense of intuition.
Respond to issues of all sizes, from major outages to minor alerts, and resolve or reach out as needed to keep our entire Engineering operations healthy.
Work with subject matter experts to learn new skills and bring them "home" to the NOC.
Create and tweak new alerts to improve our awareness and consistent responsiveness.
Incident coordination, including sending mobile alerts to other Engineering teams based on the size and scope of incidents.
Solve problems with an eye on preventing them from recurring.