home
 
 
 
Forecasting
© Charles Chandler
 
Page: 1  2   3 
'10-10-08, 06:22
 
Charles Chandler
Baltimore, MD
 
 
This topic is for discussion of the graphical, online Tornado Outlook Contest at charles-chandler.org. This is a continuation of the discussion on another one of my bulletin boards:
 
 
Doesn't really seem like anybody is participating in this contest anymore, but still, it was a cool idea, and I'm still open to suggestions.
'11-04-06, 04:37
 
nje310

I'm going to keep "My Great Title" as the subject line on my first post here, because I like it.

I'm still participating in the contest, and I enjoy it. It would be cool if we could get more people to participate somehow.

I just had one question right now. Is there a slight discrepancy in the map and the actual coordinates of a box you draw and posted tornado reports? Each time I draw an outlook, the actual point that is plotted is slightly north just slightly west of where I clicked to plot a point. Over time, I have learned to click a bit southeast of where I want my points, or manually alter the coordinates for a point such that it is in the correct location on the map.

That said, even if my outlook is where I want it to be when I see it on the map, I do not know, unless I plot the exact coordinates somewhere else, if my outlook is where it should be, and thus whether tornado reports I may expect to be inside my box are actually outside. Tornado reports do not seem to plot correctly on the map either (again, plotting a bit southeast of where they should be).

For example, on my most recent outlook, valid Monday, April 4, my outlook when viewed on the map clearly extends south of the Mississippi coastline, as I intended. A tornado was reported very near the Mississippi coast (see SPC's reports on their website) near Ocean Springs, MS (see report at end of this message). This is clearly on land, but the Tornado Outlook Contest maps show the report to be in the Gulf of Mexico, off the coast and outside my outlook box. I'm not sure if the map is incorrect, or the report is plotted in the incorrect location, but this is a report that should be in my outlook (on land) but is in the water instead. How is this happening?

0145
5 E OCEAN SPRINGS JACKSON MS 3040 8872

ROOFING PEELED OFF OF A COUPLE METAL COMMERCIAL BUILDINGS IN FOUNTAINBLEAU AREA.WINDOW BLOWN OUT OF TWO HOUSE. LARGE SECTION OF TWO FENCES BLOWN DOWN. SEVERAL MEDIUM TR (LIX) (http://www.crh.noaa.gov/arx/Stormready/WFOlist.php)

'11-04-06, 15:08
 
Charles Chandler
Baltimore, MD
 
 
Hi nje310,
 
Glad to see somebody using the site. Originally, I thought that it would catch on faster, as it's a useful exercise for people interested in severe weather, and it's fun. So I figured that if I built it, the people would come. But what I've learned, from this and other sites, and from other people that I've talked to, is that somebody has to evangelize a site in order for it to catch on. Unfortunately, I'm just not good at that. I'm a programmer, not a salesperson — what can I say? :) Anyway, you have to tell 2 friends. Then, if it actually has intrinsic merit, it will catch on. But if you just rely on people finding it with Google searches, and word spreading that way, it won't. Most people are going to look at how many other people are on a site, and judge the value that way. So who is going to get on board, when nobody else is already on board? :)
 
Yes, there are some minor inaccuracies. I'll see what I can do... :) The problem is that the map didn't come with the exact parameters for the Lambert Conical Projection that was used to generate it. So I just guessed at the parameters until I got it close. The inaccuracy varies east-west and north-south. After hours of guessing at the projection, scaling, and rotation, I figured I had it close enough. But I can generate a new map with known parameters, so I'll try that. Give me a few days.
 
Charles
'11-04-10, 05:49
 
Charles Chandler
Baltimore, MD
 
 
OK, I fixed the plotting inaccuracy, to within +/- 1 pixel. So now the tornadoes that are over land will be... over land, not water. :) The plotting on the Unisys maps still needs work. (I don't know if you use them or not, but in the outlook editor, you can select the map type, where you get options for 850 mb, 300 mb, and RH/Lift. These maps are from Unisys.) They use a different projection, and I haven't figured out the parameters for that yet.
 
The pick-points are still a bit rough. Basically, the map is divided up into an 8-pixel grid, and wherever you click, it finds which 8x8 square you clicked in. If I re-write the point-picker in JavaScript, I can get it accurate to the pixel, but that will take more time.
 
BTW, I was thinking about altering the scoring algorithm. There have been a number of very high-quality outlooks, but they got bad scores because of a few tornadoes on the other side of the country, so there was a big "dst missed" factor. So I'm thinking that a few stray tornadoes shouldn't hurt a good outlook, and that perhaps I should do something like disregard a certain number (or percentage) of the tornadoes that were the furthest away. So I'll look into that.
'11-04-11, 03:30
 
nje310

Thanks for the fix.  I'm not so sure that it didn't move past outlooks themselves, but that's a minor issue.  For example, my last outlook for April 4 I know was south of the Mississippi coast.  While the tornado report was moved north to the coast, my outlook box was also moved north to not include the entire land area of the state.  Again, minor issue, but if it's fixed for future outlooks, then I won't complain too much.

I don't use the Unisys maps.  I have seen them on there, and looked at them, but I don't use them at all.  I generate my outlook area based on different places I look at (I weigh heavily on the SREF I told you about earlier) and have that in mind before I go to this site to draw an outlook.  Also, if the pick-points aren't exactly correct, that's no big deal either as I have been manually altering the coordinates to correct for this when I do the outlook.  So don't worry about that, at least for me.

I've encountered at least one such outlook myself that you mention with a tornado on the other side of the country, notably one in which there was a weak tornado in Oregon (tornadoes here in the Pacific Northwest are rare and always weak!) and I got a high dst missed score.  Disregarding a certain number or percentage or tornadoes may fix that issue, but may cause other issues (?).  I've had some thoughts about scoring change proposals as I've gone through this, but I'll have to remember what they were.  Many of them though were "this doesn't seem right; I'll have to think of a way to make this make more sense" - so that I didn't have a specific change yet in mind.

'11-04-13, 18:48
 
Charles Chandler
Baltimore, MD
 
 
I fixed the sloppy pick-point thing, so now you'll get what you pick, to within a pixel or two. This will save the manual tweaking that you and I have been doing. :) (In the process I let a test outlook slip past the deadline, so I got a really bad score on that one — d'oh!) I still haven't fixed the Unisys maps, but you said that wasn't an issue for you. I like seeing the outlook poly on the various maps, but I know what you're saying about there being better maps elsewhere, so just seeing 300mb/850mb/Lift info isn't the whole story. Anyway, I use them, so eventually I'll get that straightened out.
 
I was thinking about overlooking tornadoes more than 1500 miles away, or something like that. The reasoning there would be that a tornado that far away was definitely part of a different system. Since you can only define one outlook poly, you should only be scored on the basis of how much activity occurred as a result of the one system that you studied. Anyway, I'll think about it some more.
'11-04-15, 03:44
 
nje310

Actually, limiting the distance is a good idea... maybe even less than 1500 miles away, especially if it's in an east-west direction, because of the different system thing and the fact that you can only draw one polygon.

If this was a contest really intent on predicting tornado locations, there would also be negative points for tornadoes that occur when no outlook was made at all, but I know this is no such contest, and many people don't dedicate their lives to it either!  haha.  Take this week for example - tornadoes are occurring across the southern Plains and southeast today and tomorrow, but I'm at a conference this week and have had no time to look at this recently.  I could do an outlook for tomorrow, but I have found it more beneficial point-wise to do the outlooks further in advance due to the lead-time factor.

One thing I did think about was single tornadoes vs. outbreaks.  If you draw a small outlook polygon and a single tornado falls in the area - for example, my February 1 polygon if it was moved one state east, or my May 7 outlook from last year, you get 100% of the tornadoes in the polygon and a very good score with a small area.  Fine, that makes sense.  But lets say you expect a large number of tornadoes, like an outbreak.  If it's something like the Super Outbreak of April 3-4, 1974, you would have had to draw a large polygon from the Gulf Coast to Michigan to capture every single tornado, and you would get a lot less points because your polygon was so big.  But it had to be big.  Even with not so large an outbreak, take my most recent polygon from April 4.  I expected a significant event, and I got the main area of tornadoes pretty accurately, but in order to have captured all of them, I would have had to draw a fairly large polygon from Mississippi/Louisiana north to central Ohio and east to North Carolina, a polygon that would have covered parts of at least 11 states and likely a few more unless I knew exactly where to cut the line to the west to avoid Indiana and Arkansas.  I have to wonder if the points lost for increase in area and perimeter would have been worth the points gained for percentage of events and dst missed.  Probably, but even so, it's really tough to draw such a large area knowing you are starting at at a lower value.

On a similar note, I'm guessing it's probably a lot of work to work tornado strength in the scoring.  Many of the Enhanced Fujita ratings aren't determined until NWS survey teams go out into the field to do the surveying, I know, and are often not added to the local storm reports themselves.  But there could be one isolated "weak" EF0 tornado within a small outlook polygon that could give a very high score, while in an outbreak, you could have drawn a polygon in an area that captures every single EF2, EF3, EF4, and EF5 tornado, with some weaker ones missing the polygon (for example), that would score lower even though you were able to pinpoint the strongest tornadoes during a large outbreak that may have been difficult to cover the entire extent.

That said, I worked for the National Weather Service briefly in the past, and from experience I know that though most are, not 100% of tornado reports that are sent out are 100% reliable.  There are also multiple reports sent out for the same tornadoes, especially the longer-track ones, near larger population centers, or where several chasers have converged on the same tornado.  There are tornadoes that go by with no reports.  There are funnel clouds that never touch the ground, or maybe even barely come out of a cloud, that are called in by the public to NWS offices as tornadoes, and thus are sent out as reports (those are not too common because of the training of storm spotters, however).  Of course, most of those are things that wouldn't affect the scores too much, and are things I try to take into account when I do outlooks (for example, on my most recent outlook, I intentionally cut out most of the entire southern Louisiana marshland/Mississippi delta south of the New Orleans area because of the scarcity of population there).

Those are some of the thoughts I've had.  When I remember more I'll include them.

'11-04-15, 06:28
 
Charles Chandler
Baltimore, MD
 
 
I agree about isolated vs. outbreak tornadoes. So I added a "density" factor (not as part of the scoring yet, but just another bit of information to look at). This is the number of events per 100 square miles in the outlook poly. You can also sort the outlooks by this factor, and you'll see that a small poly with a couple of events gets the same "density" score as a large poly with a lot of events, which is the way it ought to be. So I think I like this. I'm thinking that this should replace the percentage inside/nearby factors. The density factor still gives you partial credit for events within 100 miles of the outlook poly.
 
I'd just as soon rule out EF0s and EF1s, and base the whole thing on the major events (EF2+). But then you'd have to wait a couple of days to find out your score, and I don't even know where to get the data.
'11-04-17, 08:24
 
nje310

Most of the time you might have to visit the websites of the National Weather Service area where the tornado occurred, following their survey of its track, to find out the rating.  That can definitely get cumbersome, and not all offices do this for all weaker tornadoes.  Sometimes they will also go back and add it to the text of the storm report, but that's unfortunately not always the case.  At some point the tornado ratings do get to SPC, but I'm not sure if there is a place to find that data; I have not yet found one.  Here's an example (from April 4 this month) of EF-ratings that have been added to many, but not all, of the tornado reports if you look at the "Comments" column: http://www.spc.noaa.gov/climo/reports/110404_rpts.html.  Even some from last year haven't been added.  One comment about basing everything on major events (EF2+) though - even if you were to do that, I think past polygons would have to keep all tornado reports in the scoring because the polygons themselves might have been created differently if the forecaster knew that only the stronger tornadoes would be counted.

Question about the density factor - how do you think it might be used in the scoring?  And does it only count the events that fell in the polygon, or does it count those outside it as well (maybe you answered this that it counts those outside it within the buffer given partial credit down to 0 at the blue line)?  I've look at examples such as yours and my polygons for May 7 (and same time) last year and try to figure out which would get the higher score if the percentage factors are replaced by density, since your outlook polygon, which didn't include the one single report, has a higher density (albeit only slightly) than mine, which did include it.

I have another qualm about the density factor - well I guess even the way things are right now.  What if you think there are two areas of possible tornado maximum chances (caused by the same system) with not much of a chance between, say one in western Ohio and another in northern Alabama (things like this happen, such as in the big outbreak I was living in northwest Ohio for on November 10, 2002 - http://www.spc.noaa.gov/climo/reports/021110_rpts.html - but say this isn't an outbreak).  Because you can only draw one polygon, you draw one to encompass the entire area between northwest Ohio and Montgomery.  That's a fairly large polygon, but you might not be expecting a large number of tornadoes in that area, just a few in northwest or west central Ohio and a few in northern Alabama.  They aren't far enough apart to fall under your proposed distance rule (and in fact they could be caused by the same system as in that 11/10/02 case).  Do you draw a large polygon and risk a small density score?  Do you draw a smaller polygon around one of the two areas and risk a large dst missed score?  Or maybe you draw polygons around the two areas connected by a very narrow sliver between the two such that it is one polygon with not too large a density score (but that would increase the negative perimeter score)?  This kind of situation is not limited to the proposed density score, it's an issue right now as well with the area and perimeter scores.

I have another suggestion - how hard would it be to include the year in the dates for each outlook?

One more thing.  My browser remembered my password for this site for well over a year, and today it wasn't there anymore.  For the life of me I couldn't get it right, apparently, so I created a new user name, cumulonimbus, as I really wanted to get an outlook out for Tuesday (I would have liked to do it a day ago but was too busy unfortunately).  Is there any way to put this new outlook under my nje310 name and secretly get me my password?  haha!

'11-04-17, 13:07
 
Charles Chandler
Baltimore, MD
 
 
OK, so I'm definitely not going to make a habit of tracking down EF ratings for tornado reports, if I have to poke around for them. Sometimes I have the time to futz with this, and sometimes I don't, so I have to set up the site such that it will run itself in my (sometimes protracted) absence. Anything that requires ongoing consistent manual labor on my part is just not something that I can commit to. So we're not going to weight reports on the basis of EF ratings. :(
 
The outlooks for 2010-05-07, 18:00 CST are a good example of how the density factor would work. There was only 1 tornado, in Ohio. It was fully contained in your poly (100% credit), and only 11 miles outside of mine, so I got 89% credit (since the credit fares from 100% in or on the poly, to 0% at 100 miles from the poly). And my poly was slightly smaller than yours.
 
The formula is (tornadoes inside/nearby) / (poly area)...
 
yours: 1 / 142138 = .000703541629 (~ .00070)
 
mine: .89 / 126084 = .000705878621 (~ .00071)
 
So this looks like it is correct, even though it's a rare case where I got a better score than you. :)
 
As concerns multiple areas of interest, you're right — there's no good solution if you can only draw one poly. Eventually I'll provide support for multiple polys. I was even thinking about having multiple polys, where you could assign different probability factors. So you could do a 2% poly, and a 5% poly, etc. Then you would get weighted credit for each poly. That would give you the ability to express your intent a lot more specifically. So I'll see what I can do. In the meantime...
 
Let's think about doing away with all of the existing penalties (poly area, perimeter, and dst missed). Rather, I think it should be like this:
 
(lead time bonus) + (density bonus) + (percentage inside/nearby bonus) = (score)
 
Sort the maps by density, and look at how it works out. You can see visually that the outlooks that really had the right idea score the highest, with tight polys that enclosed a lot of tornadoes, while the huge polys that missed most of the tornadoes — like mine usually :) — got bad scores. So density should definitely be the key factor.
 
The reason for the lead time bonus is obvious — it makes you gamble on model maps that are further in advance.
 
The reason to also factor in the percentage inside/nearby is less obvious, but if you think about it, it makes sense. The density factor already takes this into account, because that's events / area. But if that's all you had, people would just select a point in the middle of the highest activity area. That would be cool, but there's already a contest like that (the Virtual Storm Chase, which I think you participate in). So the difference with the Outlook contest is that you have to define the region, not just the highest activity point. So you still have to give extra credit for the percentage of events inside/nearby, forcing people to select a region instead of a point.
 
So I think that these are the factors to look at. Then, we just get rid of the poly area penalty, because it's already taken into account in the density factor. We also get rid of the perimeter and dst missed penalties. So if there are two different areas of activity, you just pick one, and neglect the other, and it won't hurt your score if you missed a bunch of events, nor will you have to draw a big or weird poly to enclose both areas of interest.
 
I'll work on getting this all hooked up, so you can see.
Page: 1  2   3 


← PREV Powered by Quick Disclosure Lite
© 2010~2024 SCS-INC.US
NEXT →