(file) Return to narrative CVS log (file) Jump to this file's LXR Page (dir) Up to [CENS] / tos-contrib / sympathy

File: [CENS] / tos-contrib / sympathy / narrative (download)
Revision: 1.11, Mon Mar 21 06:46:52 2005 UTC (4 years, 8 months ago) by nithya
Branch: MAIN
CVS Tags: rdd_alpha_version_1, pregeonet, acoustic-05-18-06, PRE_TOSNIC_FIX, PRE_64BIT, MOTENIC_PRE_BUGFIX_20050415, LAURA_CALIBRATION_EXPERIMENTS, HEAD, ESS_RELEASE_3_5, ESS_RELEASE_3_4, ESS_RELEASE_3_3, ESS_RELEASE_3_2, ESS_RELEASE_3_1, ESS_RELEASE_3_0, ESS_RELEASE_2_0, ESS_CONNECTIVITY, ESS_CENTROUTE_TESTING, ESS2-CMS-V1_5_pretest, ESS2-CMS-V1_4cMergeSympathy_2, ESS2-CMS-V1_4c, ESS2-CMS-V1_4b, ESS2-CMS-V1_4a, ESS2-CMS-V1_3, ESS2-CMS-V1_2, ESS2-CMS-V1_1, EMSTAR_RELEASE_2_5, CYCLOPS_RELEASE_CANDIDATE_2_0, CYCLOPS_PRERELEASE_STABLE, CENTROUTE_EMSTAR_SOCKETS, BG_1_0, BANGLADESH_ARSENIC_1_2, BANGLADESH_ARSENIC_1_1, AMARSS_JR_DEPLOYMENT_6_05_07
Changes since 1.10: +2 -0 lines
Works with sympathy_devel now!

Ahas:
1) Knowing number requests vs how many it tried to send vs... helpful!
2) Knowing how many of each type of packet very helpful - often just certain
	type of packet is not making it through!
3) Having events is not very helpful so far - only metrics
4) Giving the positives is helpful as well as the negatives - maybe have
   a file where i summarize positives? but TOO MUCH DATA - how do iake
   it a useful summary - what goes in a useful summary?
5) Setting check period to 2xrequest period.
6) BIST - even for sympathy - by running troubled_node checks!
	- just looking at metrics reported about each app

Solutions:
1) Start/stop topo-data

Problem:
1) Not getting sympathy responses, even from one-hop neighbors, but getting 
beacons from them. 
Measured: 
- number of requests they are getting (they were getting requests),
	they just weren't sending responses.
- then added, number of times tried to send but failed, also very few.

Things that help:
- nice to have space to put some external variables - during pre-dep
  debugging
- should have sympathy periodically send metric info with these
vars - can use this to see if sympathy just not getting tasked or
somethign else going wrong.
- use sympathy request/response as indicator of whether motes can
be reached and be tasked - cuz sympathy uses same routing overlay

2) Not getting sympathy responses
  - see reported number of sympathy requests received - this needs to
  	be periodically just broadcast without being requested.

3) Not getting sypathy respnoses
  - periodically have metrics sent without request

4) Not getting sympathy requests:
  - answer questions:
     - are we getting regular automated metric returns from node?
        - if no: goto check_routing
	- if yes: is node transmitting requested metrics?
	   - if yes: is the node transmitting sympathy events?
	      - if no: is the node receiving sympathy requests?
	         - if no:
	         - if yes: are they continuing to increase even though number
	      		events transmitted are not?
	      - if yes:
           - if no:

      - check_routing:

5) Fault detection: identify nodes that are not receiving sympathy
	requests, or not sending as many event updates as they have 
	received - is this because their last event hours is not
	same as others?
** Separate number requests msgs rcvd into: number requested
    events rcvd vs number requested metrics rcvd, and
    specify number requests sent out!
** for nodes that havent sent many events:
     - check how many events were transmitted compared to
           how many were received!
     - for these nodes, find nodes with same next-hop
     - else check how many requests they received
     - else see if last-event time is just < time-awake (so maybe
         just dont have events to send)

6) Too many errors (caught that with num errors) on nodes with node-2 as a next-hop.
so added jitter to response to flooded response

7) Wasn't getting metrics or data from a node, BUT i was getting
other data packets (beacons/neighbor-lists etc) - so added this to
the test.

8) observe performance degrade (i.e. stop getting data), and then
  node would reboot

Design:
start/stop symp-requests - command-line/status-device
check for reboot! (ensure its not just a roll-over, but a reboot!)

Why is wsn different:
- energy debugging, performance/network usage assertions, 
functionality debugging all collide -
	how do we express those needs/rules -even dynamically, how do you assess

X graph time awake
X graph its fault-category
Classify nodes:
- we are not getting data from node
   - nodes have no data to send
   - data is not getting through
- performance: nodes sending data, but we arent getting a lot of it
- nodes that are fine


CENS CVS Mailing List
Powered by
ViewCVS 0.9.2