Context Navigation

Inventory

Timestamp:: Jun 5, 2008, 5:59:44 PM (18 years ago)
Author:: Joseph F. Miklojcik III
Comment:: —

Legend:

: Unmodified
: Added
: Removed
: Modified

Internal/Infrastructure/OMF/GridServices/Inventory

-              v8
+              v9
  * It should have an (optional) query qualifier (with interface to CMC) to return only the functional node set.
  * It should have ...
-It is important to consider the Inventory database schema part of the experimental interface.  The advantage of building a web service for Inventory is that it can provide commonly used queries that incorporate data from other OMF web services (namely the CMC, as in "available" in "give me all available intel nodes").  At the same time, it will be easier for experiment scripts to build custom SQL queries that implement complex criteria than to utilize a more abstract web service.  Furthermore, arguably, SQL is RESTful in the first place.
 == Service Configuration File ==
 …
 }}}
-== Inventory Gathering ==
-Inventory, is the "experiment" that uses a special much larger image, which can have a relatively large number of drivers for a wide variety of devices.  Inventory is used to do the slower more difficult job of generating informative scans of the pci and usb busses.
-As of 10/16/2007, we run the inventory every Wednesday during maintenance.  Plan is to, in addition, run Inventory whenever there is a change in hardware configuration.  There exists a full [source:/Inventory/inventory inventory package] that inserts "all" information into the database.
-  The gathering procedure uses:
- * lspci
- * lsusb
- * dmidecode
 === Known Device IDs ===
 Devices that are discovered by gathering procedure are uniquely identified by either [http://www.linux-usb.org/usb.ids USB ids] or [http://www.pcidatabase.com/vendors.php?sort=id PCI ids]. The ids for most important devices are listed in the ID table:
+Devices that are discovered by gathering procedure are uniquely identified by either [http://www.linux-usb.org/usb.ids USB ids] or [http://www.pcidatabase.com/vendors.php?sort=id PCI ids]. The ids for most important devices are listed below:
 || Bus type || Vendor ID || Vendor || Device ID || Description ||
 …
 || -"- || 0x8086 (32902) || [http://www.pcidatabase.com/vendor_details.php?id=1302 Intel Corp.] || 0x4223 (16931) || Intel PRO/Wireless 2915ABG Network Connectio ||
 || -"- || 0x168c (5772) || [http://www.pcidatabase.com/vendor_details.php?id=174 Atheros Communications] || 0x0013 (19) || Atheros AR5212, AR5213 802.11a/b/g Wireless Adapter ||
-|| -"- || 0x1106 (4358) || [http://www.pcidatabase.com/vendor_details.php?id=648 VIA Technology] || 0x3122 (12578) || VT8623 !CastleRock AGP 8X Controller (VGA) ||
-|| -"- || -"- || -"- || 0x0571 (1393) || Bus Master IDE Controller ||
-|| -"- || -"- || -"- || 0x3177 (12663) || VT8235 PCI to ISA Bridge ||
-|| -"- || -"- || -"- || 0x3104 (12548) || VT6202 USB 2.0 Enhanced Host Controller ||
-|| -"- || -"- || -"- || 0x3038 (12344) || USB&UHCI Controller ||
-|| -"- || -"- || -"- || 0xb091 (45201) || VT8633 PCI-to-PCI Bridge (AGP) ||
-|| -"- || -"- || -"- || 0x3123 (12579) || VT8623 CPU to PCI Bridge ||
-|| USB || 0x3f0 (1008) || Hewlett-Packard  || 0x0024 (36) || KU-0316 Keyboard ||
-We should look into turning inventory image into PXE image and avoid imaging phase completely!
 == Inventory Database ==
+Inventory database lives on internal1 and consists of 6 tables:
+The Inventory database is a Mysql database on {{{internal1}}}.  The schema was created to be general and scalable, so that it can support many different kinds of nodes and attached devices, and can hold inventory data for multiple testbeds.  In general names of tables are plural nouns.  When rows in a particular table represent physical objects or concrete ideas, as opposed to relationships or metadata, each row in a table will have a unique id.  When an id is referenced from another table, the name of the referencing column will end in {{{_id}}}.  Although it is not set in stone, the schema for the inventory database should not change significantly without advance notice on {{{orbit-dev}}}.  No data consistency constraints other than automatic ID number generation are implemented in the DBMS, only in the application code.  This keeps things flexible enough so that there are lots of options when you find 400 nodes can't all successfully lock tables and rows in the database at once.
+. device_kinds
+. device_tags
+. inventories
+. locations
+. motherboards
+. nodes
+. peripherals
+. testbeds
+It is important to consider the Inventory database schema part of the experimental interface.  The advantage of building a web service for Inventory is that it can provide commonly used queries that incorporate data from other OMF web services (namely the CMC, as in "available" in "give me all available intel nodes").  At the same time, it will be easier for experiment scripts to build custom SQL queries that implement complex criteria than to utilize a more abstract web service.  Furthermore, arguably, SQL is RESTful in the first place.
+=== device_kinds table ===
+The database is used both as the authoritative source of data for the OMF inventory service, and for maintenance purposes.  Most of the data is maintained automatically, but some data (such as chassis ID number) can not be collected automatically and must be entered by hand.
+|| Field        || Type        || Null || Key || Default || Extra          ||
+|| id           || int(11)     || NO   || PRI || NULL    || auto_increment ||
+|| inventory_id || int(11)     || NO   ||     || NULL    ||                ||
+|| oui          || varchar(8)  || YES  ||     || NULL    ||                ||
+|| bus          || varchar(16) || YES  ||     || NULL    ||                ||
+|| vendor       || int(11)     || NO   ||     || NULL    ||                ||
+|| device       || int(11)     || NO   ||     || NULL    ||                ||
+'''These schemas are still changing, so assume the actual database is more authoritative than this document.'''
+== Inventory Gathering ==
+=== device_tags table ===
+Inventory gathering is done as any other ORBIT experiment.  It uses a relatively large image, which includes a thorough set of drivers for a wide variety of devices.  A new node undergoes an Enrollment process when it is first deployed, implemented as a small PXE image with just enough of a payload to associate the node location with an IP address and report it.  In contrast, the Inventory process does the slower more complex job of generating informative scans of the pci and usb busses.
+|| Field         || Type        || Null || Key || Default || Extra ||
+|| tag           || varchar(64) || NO   ||     || NULL    ||       ||
+|| peripheral_id || int(11)     || NO   ||     || NULL    ||       ||
+As of 10/16/2007, we run the inventory every Wednesday during maintenance.  We may also run it when there are significant hardware changes.
+=== motherboards table ===
+|| Field       || Type        || Null || Key || Default           || Extra          || Description                                   ||
+|| id          || varchar(64) || NO   || PRI ||                   ||                || UUID of the motherboard                       ||
+|| node_id     || varchar(64) || YES  || UNI || NULL              ||                || Link to 'id' in nodes table                   ||
+|| sn   || varchar(16) || NO   || UNI ||                   ||                || manufacturer serial number of the motherboard ||
+|| hd_sn        || varchar(16) || NO   || UNI ||                   ||                || Hard drive serial number                      ||
+|| cpu_type         || varchar(X)  || YES  ||     || NULL              ||                || CPU Type                                      ||
+|| cpu_speed       || int(11)     || YES  ||     || 0                 ||                || CPU speed in MHz                              ||
+|| memory      || int(11)     || YES  ||     || 0                 ||                || Memory size in MB                             ||
+|| hd_size        || int(11)     || YES  ||     || 0                 ||                || Hard disk size in bytes                          ||
+|| updated_on  || timestamp   || NO   ||     || CURRENT_TIMESTAMP ||                ||                                               ||
+|| updated_by  || varchar(64) || NO   ||     || ||                ||                                               ||
+The gathering procedure uses {{{lspci}}}, {{{lsusb}}}, {{{dmidecode}}} and {{{sysfs}}}.  The most frustrating and difficult part of maintaining the code is keeping up with changes in {{{/sys}}} over even minor kernel revisions.
+(NOTE: 'node_id' is NULL when this motherboard is not installed on any node, i.e. new parts that just got in, or stored extra/spare parts)
+We could also move the hard-drive info in a separate table if we allow hard-drive swapping between motherboards.
+=== nodes table ===
+|| Field       || Type        || Null || Key || Default || Extra          || Description ||
+|| id          || varchar(64) || NO   || PRI ||         ||                || UUID of the node (i.e. the chassis). ||
+|| chassis_sn  || varchar(16) || NO   || UNI ||         ||                || Manufacturer serial number of the node's chassis ||
+|| location_id || varchar(64) || YES  || UNI || NULL    ||                || Link to 'id' in 'locations' table ||
+|| updated_on  || timestamp   || NO   ||     || CURRENT_TIMESTAMP ||                ||                                               ||
+|| updated_by  || varchar(64) || NO   ||     ||  ||                ||                                               ||
+(NOTE: 'location_id' is NULL when this chassis is not installed at any location, i.e. new parts that just got in, or stored extra/spare parts)
+=== locations table ===
+|| Field       || Type        || Null || Key || Default           || Extra          || Description ||
+|| id          || varchar(64) || NO   || PRI ||         ||                || UUID of the location ||
+|| x           || int(11)     || NO   ||     || 0                 ||                || ||
+|| y           || int(11)     || NO   ||     || 0                 ||                || ||
+|| z           || int(11)     || NO   ||     || 0                 ||                || ||
+|| unit        || int(11)     || NO   ||     || 0                 ||                || ||
+|| testbed_id  || varchar(64) || NO   ||     || 0                 ||                || Link to 'id' in 'testbeds' table ||
+|| updated_on  || timestamp   || NO   ||     || CURRENT_TIMESTAMP ||                ||                                               ||
+|| updated_by  || varchar(64) || NO   ||     ||  ||                ||                                               ||
+=== testbeds (resources) table ===
+|| Field      || Type        || Null || Key || Default || Extra  || Description ||
+|| id         || varchar(64)  || NO  || PRI ||         ||        || UUID of the testbed ||
+|| domain     || varchar(4)  || NO   || UNI ||         ||        || ||
+|| control_ip || varchar(12) || NO   || UNI ||         ||        || ||
+|| data_ip    || varchar(12) || NO   || UNI ||         ||        || ||
+|| cm_ip      || varchar(12) || NO   ||     ||         ||        || ||
+|| latitude   || int(11)     || NO   ||     || 0       ||        || ||
+|| longitude  || int(11)     || NO   ||     || 0       ||        || ||
+|| elevation  || int(11)     || NO   ||     || 0       ||        || ||
+|| updated_on  || timestamp   || NO   ||     || CURRENT_TIMESTAMP ||                ||                                               ||
+|| updated_by  || varchar(64) || NO   ||     ||  ||                ||                                               ||
+== Notes ==
+The design goal of this schema is to allow the double use of the Inventory database as:
+  * a source of information for user experiment scripts
+  * a 'real' hardware inventory giving operators information on which piece of hardware (chassis, motherboard) is used (or not) in which testbed/location.
+The entries in the ''testbeds'', ''locations'', ''nodes'' tables are manually created and updated by operators, when:
+  * a new testbed is being deployed
+  * a new location is added to the testbed (e.g. physical place-holder creation on a sandbox testbed for future addition of a third node)
+  * a new purchased chassis (i.e. empty node box) is delivered, or mounted to a new location, or switched from a location to another one
+We do not expect these events to happen very often, thus it should be ok to make the operator responsible for creating/updating the related entries. (furthermore he/she could also use some scripts to do this job...)
+The entries in the ''motherboards'' table are also manually created upon delivery of a new purchased motherboard. The only field that needs to be manually filled by the operator is the ''node_id'', which will happen when the operator installs a new motherboard inside a node/chassis. All the other fields are automatically populated by the Inventory process (i.e. the scripts in the inventory package).
+The ''interfaces'' and ''devices'' tables are created and updated as in the previous schema.
+These schemas have a tendency to change, so assume the actual database is more authoritative than this document.
+The current inventory code can all be found in the image {{{inventory.ndz}}}.  Source is controlled by {{{git}}}.  At the time of this writing there is no functioning {{{git}}} server at WINLAB, so just mail patches to {{{jfm3}}} at {{{winlab.rutgers.edu}}}.