10 years agoadd upload stub, expose query functions with plain text templates,
Stephen Soltesz [Fri, 28 Aug 2009 22:15:30 +0000]
add upload stub, expose query functions with plain text templates,
add extra fields to query selection

10 years agoadd plain-text versions of the query output and nodelist as a test
Stephen Soltesz [Fri, 28 Aug 2009 21:56:47 +0000]
add plain-text versions of the query output and nodelist as a test
removed inter-site link on actionlist
report all errors by moving bmlog link to 'message' column in detailview

10 years agonot sure how these got past. remove monitor module references from pcucontrol.
Stephen Soltesz [Thu, 27 Aug 2009 18:50:13 +0000]
not sure how these got past. remove monitor module references from pcucontrol.
convert unicode strings into standard strings.

10 years agodoh. used m5sum not sha1
Stephen Soltesz [Tue, 25 Aug 2009 15:15:18 +0000]
doh.  used m5sum not sha1

10 years agolatest 1.6.5 version
Stephen Soltesz [Tue, 25 Aug 2009 15:12:45 +0000]
latest 1.6.5 version

10 years agoadd pcu_name to pcufailed_notice
Stephen Soltesz [Thu, 20 Aug 2009 17:55:12 +0000]
add pcu_name to pcufailed_notice
added pcuerror_notice
added convenience functions to FindbadPCURecord
added pcuerror_notice to policy.py
create a different action type for each bootman_restore operation.
added better formatting to actionsummary, so all types of actions are displayed
fixed nodebad logic so that a node leaves the 'disabled' status when it's boot_state changes.

10 years agoSetting tag Monitor-3.0-19
Stephen Soltesz [Mon, 17 Aug 2009 22:51:21 +0000]
Setting tag Monitor-3.0-19
Major increment -
  adds multiple features and web changes
  adds new fields to db
  improved layout
  general improvements otherwise

10 years agochanged 'monitordebug' to failboot
Stephen Soltesz [Mon, 17 Aug 2009 22:03:47 +0000]
changed 'monitordebug' to failboot
added run_level to nodebad record
added site_id to database
added DRAC6 support
added GeorgeTown PCU to BayTech to catch their wacky PCU.
add extra sequences too bootman to catch fsck error
remove setting nodes to 'disabled', leave them in failboot
send exception and Unknown Error messages to config.exception_email
disable synccheck in automate-default.sh script

10 years agoadded advanced query
Stephen Soltesz [Mon, 17 Aug 2009 21:49:31 +0000]
added advanced query
added simple and detailed view rather than single, 'pcuview'
simplified nodelist
clarified some labels
added error page for quickjumps without a target
added site:* and node:* queries for quickjump:
now you can see all sites that match a pattern:
    site:mlab*  or node:*clara*

10 years agoadded firewall checks and notices
Stephen Soltesz [Tue, 28 Jul 2009 22:22:07 +0000]
added firewall checks and notices
added extra views for a simple and detailed view for sites
added firewall_notice to emailTxt
added extra fields for firewall, external_dns_status, as well as several other
fields reserved for future use, such as traceroute, uptime and rpms.
updated policy to act on firewalled nodes
add dependency on 'nc' netcat for a more reliable nmap, port probing utility.

reorganized scanapi.py, hopefully can generalize this further in the future.

10 years agoadd email notice for 'failed twice' nodes
Stephen Soltesz [Tue, 28 Jul 2009 22:16:17 +0000]
add email notice for 'failed twice' nodes
added extra bootmanager sequence string

10 years agoan upgrade script to extend convert existing databases
Stephen Soltesz [Mon, 27 Jul 2009 19:58:58 +0000]
an upgrade script to extend convert existing databases

10 years agosetup default values if they're not already set.
Stephen Soltesz [Mon, 27 Jul 2009 16:32:15 +0000]
setup default values if they're not already set.

10 years agoadd a 'delete_recent' function to ActionRecord
Stephen Soltesz [Mon, 27 Jul 2009 16:23:41 +0000]
add a 'delete_recent' function to ActionRecord
collect 'uptime'

10 years agomake zabbix import conditional
Stephen Soltesz [Mon, 27 Jul 2009 16:22:43 +0000]
make zabbix import conditional

10 years agoadd code to automatically handle manual FSCK errors.
Stephen Soltesz [Mon, 27 Jul 2009 16:22:22 +0000]
add code to automatically handle manual FSCK errors.

10 years agoadd better node configuration message.
Stephen Soltesz [Mon, 27 Jul 2009 16:21:57 +0000]
add better node configuration message.

10 years agougg. output hangs on large file contents. truncate data. need a better
Stephen Soltesz [Mon, 27 Jul 2009 16:21:26 +0000]
ugg.  output hangs on large file contents.  truncate data.  need a better

10 years agoenable zabbix option before importing zabbix db
Stephen Soltesz [Mon, 27 Jul 2009 16:20:13 +0000]
enable zabbix option before importing zabbix db

10 years agoadd links for length of query for nodescanhistory view
Stephen Soltesz [Mon, 27 Jul 2009 16:19:22 +0000]
add links for length of query for nodescanhistory view

10 years agodisable zabbix_enabled flag by default
Stephen Soltesz [Thu, 23 Jul 2009 00:34:31 +0000]
disable zabbix_enabled flag by default
copy monitor logs files for archives

10 years agodon't create/enable X_monitor user.
Barış Metin [Tue, 21 Jul 2009 10:47:59 +0000]
don't create/enable X_monitor user.

10 years agosleep could probably be reduced, but the extra newlines are required.
Stephen Soltesz [Mon, 20 Jul 2009 23:04:43 +0000]
sleep could probably be reduced, but the extra newlines are required.

10 years agoadd a timeout command to the client package
Stephen Soltesz [Thu, 9 Jul 2009 22:04:11 +0000]
add a timeout command to the client package
new boot manager sequence to bootman.py
use timeout with scanapi operations

10 years agocorrect expired() logic. was reversed.
Stephen Soltesz [Wed, 8 Jul 2009 16:41:59 +0000]
correct expired() logic. was reversed.

10 years agouse sortLastContact from plekit
Barış Metin [Thu, 2 Jul 2009 14:27:13 +0000]
use sortLastContact from plekit

10 years agogive noop myops xmlrpc call a extra parameter to allow it to work with the
Stephen Soltesz [Thu, 2 Jul 2009 01:22:40 +0000]
give noop myops xmlrpc call a extra parameter to allow it to work with the
PLC() object which passes an auth struct to every call.
clear penalties for blacklisted sites.
fixed string in grep in RunlevelAgent
added crazy fix for new BayTech sequence.

10 years agoplace more default values in /etc/planetlab/*.xml
Stephen Soltesz [Thu, 2 Jul 2009 01:20:16 +0000]
place more default values in /etc/planetlab/*.xml

10 years agoexport PATH for cronjob to work and redirect script's error output to the log file too
Barış Metin [Wed, 1 Jul 2009 16:15:15 +0000]
export PATH for cronjob to work and redirect script's error output to the log file too

10 years agoimproved post server setup code in Monitor.spec
Stephen Soltesz [Wed, 1 Jul 2009 00:24:24 +0000]
improved post server setup code in Monitor.spec
improved blacklist.py
added module path to exception in plc.py
included correct functions file for monitor-server.init

10 years agoTagging module Monitor - Monitor-3.0-18
Stephen Soltesz [Mon, 29 Jun 2009 00:12:44 +0000]
Tagging module Monitor - Monitor-3.0-18
bug fixes.
improved templates and views
cleaned controller code for web
added IPMI requirement to pcucontrol package.

10 years agobug fix in summary template
Stephen Soltesz [Mon, 29 Jun 2009 00:10:21 +0000]
bug fix in summary template
new access method for fbnode
removed fs_status check b/c it caused false negatives for down nodes that
weren't due to the check hanging.
added a new reboot check.

10 years agoremove rt3 from monitor
Stephen Soltesz [Mon, 29 Jun 2009 00:05:55 +0000]
remove rt3 from monitor

10 years agoremove rt from monitor package.
Stephen Soltesz [Mon, 29 Jun 2009 00:05:20 +0000]
remove rt from monitor package.

10 years agoadd an entry for IPMI in case db-config doesn't update.
Stephen Soltesz [Sun, 28 Jun 2009 23:58:30 +0000]
add an entry for IPMI in case db-config doesn't update.

10 years agorequire OpenIPMI-tools to provide ipmitool command.
Stephen Soltesz [Sun, 28 Jun 2009 23:44:32 +0000]
require OpenIPMI-tools to provide ipmitool command.

10 years agomake work with new libraries
Stephen Soltesz [Fri, 26 Jun 2009 23:42:15 +0000]
make work with new libraries

10 years agoadd a link to both types of node history, status and data
Stephen Soltesz [Fri, 26 Jun 2009 23:40:44 +0000]
add a link to both types of node history, status and data
add a quick search at the top of sitemenu for hostnames and loginbases

10 years agosimplify pcu bad logic
Stephen Soltesz [Thu, 25 Jun 2009 23:03:12 +0000]
simplify pcu bad logic
ignore additional fields in sitehistory record
just show days down rather than weeks.  weeks is confusing before 30 days

10 years agoadded references to google gadgets
Stephen Soltesz [Thu, 25 Jun 2009 23:01:59 +0000]
added references to google gadgets
added sitesummary and summary controls for gadgets
clarified sitehistory, and other history views
simplified controllers.py in many places, trying to make node() faster.

10 years agoconvert non-ascii character strings to just ascii
Stephen Soltesz [Thu, 25 Jun 2009 04:45:09 +0000]
convert non-ascii character strings to just ascii

10 years agoupdate LogoLinkURL in template.
Stephen Soltesz [Thu, 25 Jun 2009 04:44:33 +0000]
update LogoLinkURL in template.

10 years agocustom templates for RT
Stephen Soltesz [Thu, 25 Jun 2009 04:43:30 +0000]
custom templates for RT

10 years agouse https by default
Stephen Soltesz [Thu, 25 Jun 2009 04:43:00 +0000]
use https by default
add Custom Logo fields.

10 years agoremove unnecessary scrips and move queues to plc-template.
Stephen Soltesz [Thu, 25 Jun 2009 04:40:27 +0000]
remove unnecessary scrips and move queues to plc-template.

10 years agomake custom names make sense.
Stephen Soltesz [Thu, 25 Jun 2009 04:39:45 +0000]
make custom names make sense.

10 years agoadd template translation on all custom templates in rtconf.d/
Stephen Soltesz [Thu, 25 Jun 2009 04:38:41 +0000]
add template translation on all custom templates in rtconf.d/
copy custom templates from local dir into rt3 configuration dirs

10 years agocollect CCs from original email, with extra settings in RT_SiteConfig
Stephen Soltesz [Wed, 24 Jun 2009 23:00:54 +0000]
collect CCs from original email, with extra settings in RT_SiteConfig
no output from adduserstort.pl
move some scrips and templates from initialdata to planetlab.pl

10 years agoonly send unknownsquence messages to admin list
Stephen Soltesz [Wed, 24 Jun 2009 22:21:49 +0000]
only send unknownsquence messages to admin list
just use the PCU IP address if the hostname doesn't match the IP.

10 years agoadd filters to catch google gadgets
Stephen Soltesz [Wed, 24 Jun 2009 22:20:09 +0000]
add filters to catch google gadgets
add googlegadget templates.

simplify controllers.py by removing old code
add action* functions to controllers.py

10 years agofill-in action* pages
Stephen Soltesz [Wed, 24 Jun 2009 22:17:23 +0000]
fill-in action* pages
title pages correctly
remove nodeview and siteview since these are replaced by pcuview
add information to home page

10 years agoensure that runlevelagent starts when a new package is installed.
Stephen Soltesz [Wed, 24 Jun 2009 20:11:00 +0000]
ensure that runlevelagent starts when a new package is installed.

10 years agoRepeatedly try to AuthCheck() to avoid transient DNS errors and temporary
Stephen Soltesz [Wed, 24 Jun 2009 19:18:36 +0000]
Repeatedly try to AuthCheck() to avoid transient DNS errors and temporary
session variables out-of-sync with plc.

10 years agoprovide sitename to person query
Stephen Soltesz [Mon, 22 Jun 2009 19:53:38 +0000]
provide sitename to person query
install updates.

10 years agorun certain scripts periodically
Stephen Soltesz [Mon, 22 Jun 2009 19:51:09 +0000]
run certain scripts periodically
run certain scripts at startup
use plcquery instead of getusersfromplc
give 'priv' argument to adduserstort.pl

10 years agosimplify
Stephen Soltesz [Mon, 22 Jun 2009 18:51:28 +0000]

10 years agorename init to conf for inital configuration data
Stephen Soltesz [Mon, 22 Jun 2009 18:33:05 +0000]
rename init to conf for inital configuration data

10 years agoallow deployment-specific settings in rtinit.d/*.pl
Stephen Soltesz [Mon, 22 Jun 2009 18:18:06 +0000]
allow deployment-specific settings in rtinit.d/*.pl

10 years agotypo city.
Stephen Soltesz [Mon, 22 Jun 2009 15:52:46 +0000]
typo city.

10 years agoinclude PlanetLab-like scrips and templates by default for all queues.
Stephen Soltesz [Mon, 22 Jun 2009 15:44:09 +0000]
include PlanetLab-like scrips and templates by default for all queues.

10 years agobetter formatted auto-reply, with custom-field default value .
Stephen Soltesz [Sun, 21 Jun 2009 20:48:19 +0000]
better formatted auto-reply, with custom-field default value .

10 years agoadd cron script to sync plc db and rt every hour
Stephen Soltesz [Sun, 21 Jun 2009 20:12:25 +0000]
add cron script to sync plc db and rt every hour
install cron script.

10 years agosync users from PLC to RT db.
Stephen Soltesz [Sun, 21 Jun 2009 19:49:27 +0000]
sync users from PLC to RT db.
add sendmail configuration to monitor-rt3.init

10 years agowrong pattern.
Stephen Soltesz [Sun, 21 Jun 2009 00:15:26 +0000]
wrong pattern.

10 years agocleaned and improved monitor-rt3.init
Stephen Soltesz [Sun, 21 Jun 2009 00:04:18 +0000]
cleaned and improved monitor-rt3.init
removed unnecessary info in initaldata
added ENABLED checks to init scripts
added extra xml to Monitor.spec for default setup.
this should be fine until it's ready to go into myplc

10 years agoown new files
Stephen Soltesz [Sat, 20 Jun 2009 22:03:42 +0000]
own new files

10 years agotypo
Stephen Soltesz [Sat, 20 Jun 2009 21:57:06 +0000]

10 years agobreak apart init scripts for monitor, zabbix, and rt3
Stephen Soltesz [Sat, 20 Jun 2009 21:54:47 +0000]
break apart init scripts for monitor, zabbix, and rt3
add a monitor-rt rpm package
add templates for rt configs

10 years agodirectory for rt3 setup and configuration files
Stephen Soltesz [Sat, 20 Jun 2009 19:07:42 +0000]
directory for rt3 setup and configuration files

10 years agoadded templates for emailTxt
Stephen Soltesz [Sat, 20 Jun 2009 00:47:14 +0000]
added templates for emailTxt
added links in welcome page to site/node/pcus
added stub for actionsummary ; need to add links
fixed policy logic for disabled nodes
add an option to save an action record for messages that don't go through rt

10 years agoTagging module Monitor - Monitor-3.0-17
Stephen Soltesz [Fri, 19 Jun 2009 03:42:29 +0000]
Tagging module Monitor - Monitor-3.0-17
added bootmanager log links
addressed root cause of IntegrityErrors ; big deal
adjusted templates to accomodate fix for IntegrityErrors
added session.flush() to bootman.py to write out ActionsRecords
fixed policy to either pause penalties or apply them ; not both.

10 years agono idea how I clobbered this.
Stephen Soltesz [Fri, 19 Jun 2009 03:39:59 +0000]
no idea how I clobbered this.

10 years agopause penalties or apply them, not both...
Stephen Soltesz [Fri, 19 Jun 2009 03:36:49 +0000]
pause penalties or apply them, not both...
plcquery tool frame.

10 years agoadded bootmanager log for monitor-collected log files to be referenced by web
Stephen Soltesz [Fri, 19 Jun 2009 01:44:06 +0000]
added bootmanager log for monitor-collected log files to be referenced by web
added keyboard interrupt exception to nodequery

addressed root cause of IntegrityErrors from lousy code in controllers.py
no elements of the Objects are assigned.
this patch addresses functionality, need to clean up for clarity.

adjusted templates to use new aggregate object of nodes, pcus, and
'pre-processed' values

updated emailTxt to use just 'pattern' in www node urls

added session.flush() to bootman.py to write out ActionsRecords

10 years agoadd a directory
Stephen Soltesz [Thu, 18 Jun 2009 17:41:02 +0000]
add a directory

10 years agoTagging module Monitor - Monitor-3.0-16
Stephen Soltesz [Wed, 17 Jun 2009 21:06:11 +0000]
Tagging module Monitor - Monitor-3.0-16
Added Rpyc from 1.0 branch.
add pcuhistory
add setup-agent for password protected keys.
other minor improvements.

10 years agocopy Rpyc from 1.0 branch into trunk
Stephen Soltesz [Wed, 17 Jun 2009 20:38:04 +0000]
copy Rpyc from 1.0 branch into trunk

10 years agoadd a directory to run automated scripts by default
Stephen Soltesz [Wed, 17 Jun 2009 20:32:41 +0000]
add a directory to run automated scripts by default
add a script to setup the ssh-agent to allow for password protected keys

10 years agoadd a view for pcu history.
Stephen Soltesz [Wed, 17 Jun 2009 20:31:36 +0000]
add a view for pcu history.

10 years agoupdating with tweaks to live deployment.
Stephen Soltesz [Wed, 17 Jun 2009 20:29:40 +0000]
updating with tweaks to live deployment.

10 years agoTagging module Monitor - Monitor-3.0-15
Stephen Soltesz [Wed, 17 Jun 2009 17:10:36 +0000]
Tagging module Monitor - Monitor-3.0-15
automate install
auto-close tickets

10 years agoenable monitor in plc-config
Stephen Soltesz [Tue, 16 Jun 2009 20:35:34 +0000]
enable monitor in plc-config
add easy_install packages to update critical packages for monitor
auto-close a ticket whose status was changed in RT

10 years agoTagging module Monitor - Monitor-3.0-14
Stephen Soltesz [Fri, 12 Jun 2009 20:21:01 +0000]
Tagging module Monitor - Monitor-3.0-14
update web
update policy
added statistics dir

10 years agoconvenience, maintenance scripts
Stephen Soltesz [Fri, 12 Jun 2009 19:17:01 +0000]
convenience, maintenance scripts

10 years agoapply penalties
Stephen Soltesz [Fri, 12 Jun 2009 19:15:21 +0000]
apply penalties
send notices to users.

10 years agoadd scrappy statistics gathering scripts
Stephen Soltesz [Fri, 12 Jun 2009 16:27:07 +0000]
add scrappy statistics gathering scripts

10 years agouse 'pattern' argument to site and node pages.
Stephen Soltesz [Fri, 12 Jun 2009 16:22:49 +0000]
use 'pattern' argument to site and node pages.
add a link to ssl to allow the racadm binary to work.
add c++ compat library for racadm binary

10 years agoadd plc_nodeid and plc_siteid to history records so that the web templates can
Stephen Soltesz [Wed, 10 Jun 2009 21:54:45 +0000]
add plc_nodeid and plc_siteid to history records so that the web templates can
reference sites and nodes by id rather than name, since the site_pattern and
nodepattern references do not exist any more in the new 4.3 wwwgui.

10 years agoremove unnecessary informatiton on pcuview page
Stephen Soltesz [Wed, 10 Jun 2009 18:45:09 +0000]
remove unnecessary informatiton on pcuview page
add history for pcu
change 'increase penalty' to 'reduce privilege'

10 years agofixed what looked like a typo in pculist.kid, restores 'view all'
Stephen Soltesz [Tue, 9 Jun 2009 22:34:42 +0000]
fixed what looked like a typo in pculist.kid, restores 'view all'
use explicit model name, rather than translate names in pcucontrol/reboot.py
use TurboGears from f8 as base for install.
disable crazy pam module to let crond work correctly.
there is no pcu_id for messages.
run synciwithplc in automate-default

10 years agoTagging module Monitor - Monitor-3.0-13
Stephen Soltesz [Mon, 8 Jun 2009 18:37:34 +0000]
Tagging module Monitor - Monitor-3.0-13
remove plccache from controllers, all lookups from db.
reformat emailTxt messags
updated bootstates in bootman.py

10 years agoproperties and emailTxt
Stephen Soltesz [Sat, 6 Jun 2009 00:57:38 +0000]
properties and emailTxt

10 years agoactually delete site from db
Stephen Soltesz [Sat, 6 Jun 2009 00:56:12 +0000]
actually delete site from db

10 years agoupdates to emailTxt
Stephen Soltesz [Fri, 5 Jun 2009 20:42:35 +0000]
updates to emailTxt
typos in bootman and scanapi
fixed headers in nodehistory
removed missing pcu notices from policy

10 years agocontrollers should allow refreshes while findall is running.
Stephen Soltesz [Wed, 3 Jun 2009 23:00:03 +0000]
controllers should allow refreshes while findall is running.
add description to RunlevelAgent
something is failing in the monitor-server-deps post install process for
easy_setup dependencies.

10 years agostop using the cached values since they mess things up and keep the server out
Stephen Soltesz [Wed, 3 Jun 2009 21:36:43 +0000]
stop using the cached values since they mess things up and keep the server out
of sync with the db.  not sure what I as thinking ...

10 years agoTagging module Monitor - Monitor-3.0-12
Stephen Soltesz [Tue, 2 Jun 2009 21:33:54 +0000]
Tagging module Monitor - Monitor-3.0-12
tag of latest changes.
need to test end to end.

10 years agoadded comonquery command-line tool.
Stephen Soltesz [Tue, 2 Jun 2009 21:30:28 +0000]
added comonquery command-line tool.
added flush and clear commands to the beginning of each web entry point in
controllers.py; I think this will help address the IntegrityErrors seen here
and at PLE.
moved plccache to local functions to speed invocation of some calls (nodequery)
added several tags to bootman.py to help with new 3.0 BootManager issue.
moved bootman import in monitor/database/info/interface.py due to import
error.  still need to investigate this
added extra RPM checks to node environment checks.  This isn't put in the db
but the log files can be queried over time.

10 years agoSomething bizarre occurs with the dc7800 display after this runs. adding the
Stephen Soltesz [Tue, 2 Jun 2009 17:57:30 +0000]
Something bizarre occurs with the dc7800 display after this runs.  adding the
delay helps, but only slightly.  Apparently, disconnecting the display cable
and reattaching it clears up the problem.  No idea why.

10 years agoin nodes page: print the status computed by monitor instead of the status retrieved...
Barış Metin [Tue, 2 Jun 2009 11:48:35 +0000]
in nodes page: print the status computed by monitor instead of the status retrieved from PLC

--debug lines