meeting 2025 04 17 n57 - JacobPilawa/TriaxSchwarzschild_wiki_5 GitHub Wiki

Context

Following up on our quick meeting yesterday with some additional diagnostics and generally cleaning up the results so far. I've gotten the masked grid up to speed (after some Expanse scheduler issues), so I now have all the model points where yesterday I had been lacking a few.
Takeaways:
- I think the largest takeaway from these new diagnostics is related to the outermost bins in the masked case. We saw hints yesterday that masking these outer 4 bins seemed to push sigma quite high and wanted to look at this systemtically.
  - I've included 10 radial plots below from the 10 best models (in terms of NNLS chi2), and it does seem like masking these outer 4 bins results in the models inflating sigma/deflating h4 to a quite extreme degree. By masking these outer bins the chi2s are essentially 0 because of the large errors, but I do worry that the models are likely adding in a bunch of extra mass (either via increasing M/L or rho0) to fit the unmasked bins better, resulting in the outer most bins being so inflated in sigma (>300 km/s) and deflated in h4 (very negative near -0.1, well outside our fiducial error bar). This could potentially be why our new posteriors seemed to push the M/L and rho0 toward the boundary.
  - I think I'm more comfortable presenting results where this outer bin is misift rather than masked since I increasingly do believe the models are artificially adding a bunch of mass with the added freedom of not needing to fit this outermost bin. It's also encouraging that, when we include the outermost bins, the results from GPR + dynesty in which we include everything is in nice agremeent with the case where we don't count these outer 4 bins in the chi2 calculation.

Diagnostics

1d panels with missing models

Here's are the plots from yesterday, this time with the additional models (including the low point near ~1.15e10): images/250417/1d_panels.png

1d Panels	1-to-1	Residuals
[images/250417/1d_panels.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/1-to-1.png)	images/250417/residual.png

Radial Profiles + Orbital Composition + Anisotropy

I then grabbed the ten lowest chi2 (from the outer removed case) models and plotted their radial profiles, along with the corresponding model which has the outer 4 bins masked:
- I think this confirms broadly what we were seeing yesterday -- in the cases where we mask the outer 4 bins, the sigma profile in the outskirts gets inflated quite high to >300 km/s. This is quite different from the non-masked case, in which sigma is certianly misfit but not nearly as inflated.
- A similar trend is seen in h4, where the unmaksed cases seem to be within the errorbars of our data, whereas the masked cases push h4 to very negative values (near -0.1), well beyond our h4 errorbars.
Because the models seem to be pushing sigma/h4/h6 around so much, I'm less convinced now that we should be masking these outer bins. On one hand, it's a shame that the models are not able to fit this outer most point, but on the other hand, at least we know the models aren't putting an artificial amount of mass (either via M/L or rho0) in attempts to ignore this outermost bin while fitting the other bins much better.
- I think it's also encouraging that the case where we include all bins vs. exclude the outer 4 bins crudely are in good agremeent.

Params [BH/1e9, ML, Rho0/1e9, T, Tmaj, Tmin]	Unmasked	Masked
[9.72, 1.71, 3.39, 0.311, 0.916, 0.024]	[images/250417/best_unmasked_38.png]] ](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_646.png)
[11.49, 1.46, 3.38, 0.387, 0.919, 0.012]	[images/250417/best_unmasked_802.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_443.png)
[8.78, 1.73, 3.26, 0.381, 0.89, 0.014]	[images/250417/best_unmasked_611.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_866.png)
[9.6, 1.67, 3.22, 0.308, 0.897, 0.016]	[images/250417/best_unmasked_484.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_522.png)
[9.24, 1.65, 3.18, 0.304, 0.915, 0.018]	[images/250417/best_unmasked_752.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_360.png)
[9.78, 1.7, 2.96, 0.342, 0.891, 0.002]	[images/250417/best_unmasked_665.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_297.png)
[9.88, 1.77, 2.97, 0.311, 0.951, 0.02]	[images/250417/best_unmasked_401.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_827.png)
[9.36, 1.65, 3.4, 0.352, 0.9, 0.004]	[images/250417/best_unmasked_917.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_568.png)
[9.42, 1.7, 3.17, 0.344, 0.911, 0.012]	[images/250417/best_unmasked_800.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_226.png)
[10.3, 1.65, 3.08, 0.294, 0.893, 0.015]	[images/250417/best_unmasked_858.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/best_masked_887.png)

I also made equivalent plots for the anisotropies and orbit fractions for these same models:
- Within the region where we have data (marked very roughly by vertical dashed lines), the anisotropies and orbit fractions are quite similar to one another. However outside this region, particularly at large radii, the two cases start to diverge with the "unmasked" models preferring the outermost anisotropy to be quite negative (in this outermost mass shell). We typically have not paid much attention to the inner/outermost anisotropy shells, so it's not entirely clear if we should be concerned about this, assuming we're moving forward with the "unmasked" case.

Params [BH/1e9, ML, Rho0/1e9, T, Tmaj, Tmin]	Unmasked	Masked
[9.72, 1.71, 3.39, 0.311, 0.916, 0.024]	[images/250417/orbits_38-1.png]] ](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_646-1.png)
[11.49, 1.46, 3.38, 0.387, 0.919, 0.012]	[images/250417/orbits_802-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_443-1.png)
[8.78, 1.73, 3.26, 0.381, 0.89, 0.014]	[images/250417/orbits_611-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_866-1.png)
[9.6, 1.67, 3.22, 0.308, 0.897, 0.016]	[images/250417/orbits_484-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_522-1.png)
[9.24, 1.65, 3.18, 0.304, 0.915, 0.018]	[images/250417/orbits_752-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_360-1.png)
[9.78, 1.7, 2.96, 0.342, 0.891, 0.002]	[images/250417/orbits_665-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_297-1.png)
[9.88, 1.77, 2.97, 0.311, 0.951, 0.02]	[images/250417/orbits_401-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_827-1.png)
[9.36, 1.65, 3.4, 0.352, 0.9, 0.004]	[images/250417/orbits_917-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_568-1.png)
[9.42, 1.7, 3.17, 0.344, 0.911, 0.012]	[images/250417/orbits_800-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_226-1.png)
[10.3, 1.65, 3.08, 0.294, 0.893, 0.015]	[images/250417/orbits_858-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/orbits_887-1.png)

A slightly different set of plots, but one thing we wanted to look at was how the Grid C scalings compare when we include all the bins vs. naively remove them from the chi2 calculation, so I recreated the plots showing the scallops for these models here:
- One thing that is sort of interesting -- it seems like removing the outer 4 bins results in a preference for the high scalings, whereas including all the bins prefers the lower end of the scalings. I think this makes sense given the "masked" case seem to prefer higher masses in general, and thus these are preferentially better fit.

All bins Included	Outer 4 Excluded
[images/250417/ALLBINS-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/NOOUTER4-1.png)

Potentially a path forward

Given the radial moment plots for the masked case above (with sigma being inflated > 300km/s and the weirdness with h4), I think it might make more sense to present the results with all bins included. Sort of restating what I said above, but I'm definitely more comfortable presenting the results where this outer sigma is simply misfit rather than extremely inflated.
The nice thing, too, is that there is a rather nice agreement between the "all bins" case and the "outer 4 removed" case when we use the full set of models (4200, with 3200 of those having come from selecting the best scalings).
Here are some reminders about what these two cases look like (from the 4/15 page):

All Bins	No Outer 4 Bins
[images/250415/all.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250415/no_outer_4.png)

And here's a quick comparison of the GPR and dynesty results for K=40,60,80, and all using nu=1.5. I've included a vertical plot first followed by all of the posteriors.
- In general there is really nice agreement between the "all bins" and "no outer 4" cases here. There's a bit of movement in rho0 and M/L which I think is to be expected since we're removing our outermost kinematic bin (likely most influenced by rho/the DM halo).
- With that said, all of the parameters are consistent within the 1 sigma confidence intervals of our fiducial case.

Vertical Plot
images/250415/vertical_plot_no_Outer_4.png

And the actual posteriors:

	K=40	K=60	K=80
All Bins	[images/250410/250410_grids124_best_scales_grids_3_base_grid_alpha_K40_nu1.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250410/250410_grids124_best_scales_grids_3_base_grid_alpha_K60_nu1.5-1.png)	images/250410/250410_grids124_best_scales_grids_3_base_grid_alpha_K80_nu1.5-1.png
No Outer 4	[images/250415/250414_grids1234_best_scales_no_outer_4_grid_alpha_K40_nu1.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250415/250414_grids1234_best_scales_no_outer_4_grid_alpha_K60_nu1.5-1.png)	images/250415/250414_grids1234_best_scales_no_outer_4_grid_alpha_K80_nu1.5-1.png

And lastly, here's a "production" version of our cornerplots. This version specifically uses the "All Models, Best Scales" K=80 set of posteriors. Note that I re-ran this case with nIter=8, whereas the top version of the plot is nIter=1. This looks nice!

Production Cornerplot
images/250410/250410_nIter8_all_models_best_scales_grid_alpha_K80_nu1.5_testing-1.png

Email Follow ups

Nu Investigation on 4200 All Bins Models

I quickly ran some tests on the "all bins" case with the 4200 models from the panels above. I ran GPR and dynesty for K = 40,60,80 and nu=0.5,1.5,2.5, and infinity which are the typical cases we've looked at in the past (though we've normally quoted the nu=1.5 case):
- Takeaways/Results: There appears to be essentially 0 dependence on nu and K for the ~4200 "all bins" models!
First, here's a vertical plot comparing the 1/2/3 sigma regions for the different K's and nu's:

Vertical Plot
images/250417/vertical_plot.png

And here are the full set of posteriors:

	K=40	K=60
nu=0.5	[images/250417/250417_all_bins_grid_alpha_K40_nu0.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_all_bins_grid_alpha_K60_nu0.5-1.png)	images/250417/250417_all_bins_grid_alpha_K80_nu0.5-1.png
nu=1.5	[images/250417/250417_all_bins_grid_alpha_K40_nu1.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_all_bins_grid_alpha_K60_nu1.5-1.png)	images/250417/250417_all_bins_grid_alpha_K80_nu1.5-1.png
nu=2.5	[images/250417/250417_all_bins_grid_alpha_K40_nu2.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_all_bins_grid_alpha_K60_nu2.5-1.png)	images/250417/250417_all_bins_grid_alpha_K80_nu2.5-1.png
nu=inf	[images/250417/250417_all_bins_grid_alpha_K40_nuinf-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_all_bins_grid_alpha_K60_nuinf-1.png)	images/250417/250417_all_bins_grid_alpha_K80_nuinf-1.png

Nu Investigation on 4200 "Outer Removed" Models

Mostly for internal use, I ran the 4200 "outer removed" models through GPR + dynesty for a variety of nu's and K's, mostly to see the impact on the parameters when keeping all bins vs. crudely removing the outer 4. Here's a quick veritcal plot showing the All Bins vs. Outer 4 Removed tests:

Vertical Plot
images/250417/vertical_plot_all_cases-1.png

And here's the full set of posteriors when removing the outer 4 bins:

	K=40	K=60
nu=0.5	[images/250417/250417_no_outer_4__grid_alpha_K40_nu0.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_no_outer_4__grid_alpha_K60_nu0.5-1.png)	images/250417/250417_no_outer_4__grid_alpha_K80_nu0.5-1.png
nu=1.5	[images/250417/250417_no_outer_4__grid_alpha_K40_nu1.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_no_outer_4__grid_alpha_K60_nu1.5-1.png)	images/250417/250417_no_outer_4__grid_alpha_K80_nu1.5-1.png
nu=2.5	[images/250417/250417_no_outer_4__grid_alpha_K40_nu2.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_no_outer_4__grid_alpha_K60_nu2.5-1.png)	images/250417/250417_no_outer_4__grid_alpha_K80_nu2.5-1.png
nu=inf	[images/250417/250417_no_outer_4__grid_alpha_K40_nuinf-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/250417_no_outer_4__grid_alpha_K60_nuinf-1.png)	images/250417/250417_no_outer_4__grid_alpha_K80_nuinf-1.png

Clarification on Grid C s=1.0 Models

I also wanted to clarify that the models which were missing from yesterday's grid don't actually change the contours from GPR and dynesty, and I've got some diagnostics for that below. First, here's a quick comparison of the 1d panels for the Grid C s=1.0 models alone for the cases we're considering (plotted to the same y-axis scale).
- "All Bins" = All of the Grid C s=1.0 models while including all the bins
- "REMOVED Outer 4" = Same as "All Bins" but crudely removing the outer 4 bins
- "MASKED Outer 4" = Properly masking the outer 4 bins by inflating their errors

1d panels
images/250417/1d_panels_all_cases.png

Results/Discussion:
- Interestingly, the "All Bins" case has generally decent looking contours for all parameters, but it seems like doing anything to the outer 4 bins for this Grid C s=1.0 set of models causes the parameters to move toward the edge.
  - I suspect this doesn't affect the "All Bins" case above which uses all ~4200 models because we have substantially better sampling of the space when expanding beyond the Grid C s=1.0 case alone. As a quick reminder, ~3200 of the 4200 models actually had scalings associated with them, in addition to simply searching over a broad range of parameters due to the additional grids.
Note these posteriors use K=60 (effectively all the models are included), and I've varied nu since lower nu has often been more "flexible" in our past testing.
- Interestingly, it actually seems like the nu=2.5 and nu=inf have slightly better resolved contours for the outer bins removed/masked cases. Using nu=2.5 or nu=infinity have generally result in "less flexible" GPR fits in the past, so it was a bit surprising to see these perform a bit better than their nu=0.5 or nu=1.5 counterparts.

	nu=0.5	nu=1.5
Grid C s=1.0 ALL BINS	[images/250417/allbins_GridC_base__grid_alpha_K60_nu0.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/allbins_GridC_base__grid_alpha_K60_nu1.5-1.png)	[images/250417/allbins_GridC_base__grid_alpha_K60_nu2.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/allbins_GridC_base__grid_alpha_K60_nuinf-1.png)
Grid C s=1.0 REMOVED Outer 4	[images/250417/unmasked_GridC_base__grid_alpha_K60_nu0.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/unmasked_GridC_base__grid_alpha_K60_nu1.5-1.png)	[images/250417/unmasked_GridC_base__grid_alpha_K60_nu2.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/unmasked_GridC_base__grid_alpha_K60_nuinf-1.png)
Grid C s=1.0 MASKED Outer 4	[images/250417/masked_GridC_base__grid_alpha_K60_nu0.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/masked_GridC_base__grid_alpha_K60_nu1.5-1.png)	[images/250417/masked_GridC_base__grid_alpha_K60_nu2.5-1.png]]](/JacobPilawa/TriaxSchwarzschild_wiki_5/wiki/[[images/250417/masked_GridC_base__grid_alpha_K60_nuinf-1.png)