import warnings
warnings.filterwarnings("ignore")

from balance import Sample, load_data

target_df, sample_df = load_data()

sample = Sample.from_frame(sample_df, outcome_columns=["happiness"])
target = Sample.from_frame(target_df, outcome_columns=["happiness"])
sample_with_target = sample.set_target(target)
adjusted = sample_with_target.adjust()

INFO (2026-07-21 12:48:06,873) [__init__/<module> (line 77)]: Using balance version 0.22.0.x

INFO (2026-07-21 12:48:06,874) [__init__/<module> (line 82)]: 
balance (Version 0.22.0.x) loaded:
    📖 Documentation: https://import-balance.org/
    🛠️ Help / Issues: https://github.com/facebookresearch/balance/issues/
    📄 Citation:
        Sarig, T., Galili, T., & Eilat, R. (2023).
        balance - a Python package for balancing biased data samples.
        https://arxiv.org/abs/2307.06024

    Tip: You can view this message anytime with balance.help()

WARNING (2026-07-21 12:48:06,888) [input_validation/guess_id_column (line 336)]: Guessed id column name id for the data

WARNING (2026-07-21 12:48:06,899) [sample_frame/from_frame (line 377)]: No weights passed. Adding a 'weight' column and setting all values to 1

WARNING (2026-07-21 12:48:06,901) [input_validation/guess_id_column (line 336)]: Guessed id column name id for the data

WARNING (2026-07-21 12:48:06,915) [sample_frame/from_frame (line 377)]: No weights passed. Adding a 'weight' column and setting all values to 1

INFO (2026-07-21 12:48:06,924) [ipw/ipw (line 735)]: Starting ipw function

INFO (2026-07-21 12:48:06,928) [adjustment/apply_transformations (line 435)]: Adding the variables: []

INFO (2026-07-21 12:48:06,928) [adjustment/apply_transformations (line 436)]: Transforming the variables: ['gender', 'age_group', 'income']

INFO (2026-07-21 12:48:06,937) [adjustment/apply_transformations (line 472)]: Final variables in output: ['gender', 'age_group', 'income']

adjusted.covars().plot(library="balance");

=== gender (categorical) ===

Category | population  adjusted  sample
         |
Female   | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (44.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (29.4%)

Male     | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (55.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (70.6%)

Legend: █ population  ▒ adjusted  ▐ sample
Bar lengths are proportional to weighted frequency within each dataset.

=== age_group (categorical) ===

Category | population  adjusted  sample
         |
18-24    | ████████████████████████ (19.7%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (28.1%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (49.1%)

25-34    | ████████████████████████████████████ (29.7%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (30.7%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (30.0%)

35-44    | █████████████████████████████████████ (29.9%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (27.4%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (15.6%)

45+      | █████████████████████████ (20.6%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (13.8%)
         | ▐▐▐▐▐▐ (5.3%)

Legend: █ population  ▒ adjusted  ▐ sample
Bar lengths are proportional to weighted frequency within each dataset.

=== income (numeric, comparative) ===

Range            | population (%) | adjusted (%)   | sample (%)       
----------------------------------------------------------------------
[0.00, 8.57)     | ████████ 49.0  | ████████▒ 54.8 | ████████▒▒▒▒ 73.2
[8.57, 17.14)    | ████ 23.1      | ████ 26.3      | ███] 19.2        
[17.14, 25.71)   | ██ 13.2        | ██ 12.3        | █] 5.3           
[25.71, 34.28)   | █ 7.3          | █ 3.9          | ] 1.6            
[34.28, 42.85)   | █ 3.9          | ] 1.5          | ] 0.4            
[42.85, 51.41)   | 1.8            | 0.2            | 0.1              
[51.41, 59.98)   | 0.9            | 1.0            | 0.2              
[59.98, 68.55)   | 0.4            | 0.0            | 0.0              
[68.55, 77.12)   | 0.2            | 0.0            | 0.0              
[77.12, 85.69)   | 0.1            | 0.0            | 0.0              
[85.69, 94.26)   | 0.0            | 0.0            | 0.0              
[94.26, 102.83)  | 0.0            | 0.0            | 0.0              
[102.83, 111.40) | 0.0            | 0.0            | 0.0              
[111.40, 119.97) | 0.0            | 0.0            | 0.0              
[119.97, 128.54] | 0.0            | 0.0            | 0.0              
----------------------------------------------------------------------
Total            | 100.0          | 100.0          | 100.0            

Key: █ = shared with population, ▒ = excess,    ] = deficit

# Compact view (no blank lines between categories)
adjusted.covars().plot(
    library="balance", variables=["gender"],
    separate_categories=False,
);

=== gender (categorical) ===

Category | population  adjusted  sample
         |
Female   | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (44.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (29.4%)
Male     | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (55.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (70.6%)

Legend: █ population  ▒ adjusted  ▐ sample
Bar lengths are proportional to weighted frequency within each dataset.

# Default view (blank lines between categories)
adjusted.covars().plot(
    library="balance", variables=["gender"],
);

=== gender (categorical) ===

Category | population  adjusted  sample
         |
Female   | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (44.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (29.4%)

Male     | ██████████████████████████████████████████ (50.0%)
         | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (55.5%)
         | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (70.6%)

Legend: █ population  ▒ adjusted  ▐ sample
Bar lengths are proportional to weighted frequency within each dataset.

adjusted.covars().plot(
    library="balance", variables=["income"],
);

=== income (numeric, comparative) ===

Range            | population (%) | adjusted (%)   | sample (%)       
----------------------------------------------------------------------
[0.00, 8.57)     | ████████ 49.0  | ████████▒ 54.8 | ████████▒▒▒▒ 73.2
[8.57, 17.14)    | ████ 23.1      | ████ 26.3      | ███] 19.2        
[17.14, 25.71)   | ██ 13.2        | ██ 12.3        | █] 5.3           
[25.71, 34.28)   | █ 7.3          | █ 3.9          | ] 1.6            
[34.28, 42.85)   | █ 3.9          | ] 1.5          | ] 0.4            
[42.85, 51.41)   | 1.8            | 0.2            | 0.1              
[51.41, 59.98)   | 0.9            | 1.0            | 0.2              
[59.98, 68.55)   | 0.4            | 0.0            | 0.0              
[68.55, 77.12)   | 0.2            | 0.0            | 0.0              
[77.12, 85.69)   | 0.1            | 0.0            | 0.0              
[85.69, 94.26)   | 0.0            | 0.0            | 0.0              
[94.26, 102.83)  | 0.0            | 0.0            | 0.0              
[102.83, 111.40) | 0.0            | 0.0            | 0.0              
[111.40, 119.97) | 0.0            | 0.0            | 0.0              
[119.97, 128.54] | 0.0            | 0.0            | 0.0              
----------------------------------------------------------------------
Total            | 100.0          | 100.0          | 100.0            

Key: █ = shared with population, ▒ = excess,    ] = deficit

from balance.stats_and_plots.ascii_plots import ascii_plot_hist

dfs = [
    {"df": target_df, "weight": None},
    {"df": sample_df, "weight": None},
]
print(ascii_plot_hist(
    dfs, names=["Target", "Sample"], column="income",
))

=== income (numeric) ===

Bin              | Target  Sample
                 |
[0.00, 8.57)     | ███████████████████████████████████ (49.0%)
                 | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (73.2%)
[8.57, 17.14)    | ████████████████ (23.1%)
                 | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (19.2%)
[17.14, 25.71)   | █████████ (13.2%)
                 | ▒▒▒▒ (5.3%)
[25.71, 34.28)   | █████ (7.3%)
                 | ▒ (1.6%)
[34.28, 42.85)   | ███ (3.9%)
                 | . (0.4%)
[42.85, 51.41)   | █ (1.8%)
                 | . (0.1%)
[51.41, 59.98)   | █ (0.9%)
                 | . (0.2%)
[59.98, 68.55)   | . (0.4%)
                 |  (0.0%)
[68.55, 77.12)   | . (0.2%)
                 |  (0.0%)
[77.12, 85.69)   | . (0.1%)
                 |  (0.0%)
[85.69, 94.26)   | . (0.0%)
                 |  (0.0%)
[94.26, 102.83)  | . (0.0%)
                 |  (0.0%)
[102.83, 111.40) |  (0.0%)
                 |  (0.0%)
[111.40, 119.97) |  (0.0%)
                 |  (0.0%)
[119.97, 128.54] | . (0.0%)
                 |  (0.0%)

Legend: █ Target  ▒ Sample
Bar lengths are proportional to weighted frequency within each dataset.

from balance.stats_and_plots.ascii_plots import ascii_comparative_hist

# Target (baseline) vs unadjusted sample
dfs = [
    {"df": target_df, "weight": None},
    {"df": sample_df, "weight": None},
]
print(ascii_comparative_hist(
    dfs, names=["Target", "Sample"], column="income",
))

=== income (numeric, comparative) ===

Range            | Target (%)           | Sample (%)                 
---------------------------------------------------------------------
[0.00, 8.57)     | ███████████████ 49.0 | ███████████████▒▒▒▒▒▒▒ 73.2
[8.57, 17.14)    | ███████ 23.1         | ██████] 19.2               
[17.14, 25.71)   | ████ 13.2            | ██ ] 5.3                   
[25.71, 34.28)   | ██ 7.3               |  ] 1.6                     
[34.28, 42.85)   | █ 3.9                | ] 0.4                      
[42.85, 51.41)   | █ 1.8                | ] 0.1                      
[51.41, 59.98)   | 0.9                  | 0.2                        
[59.98, 68.55)   | 0.4                  | 0.0                        
[68.55, 77.12)   | 0.2                  | 0.0                        
[77.12, 85.69)   | 0.1                  | 0.0                        
[85.69, 94.26)   | 0.0                  | 0.0                        
[94.26, 102.83)  | 0.0                  | 0.0                        
[102.83, 111.40) | 0.0                  | 0.0                        
[111.40, 119.97) | 0.0                  | 0.0                        
[119.97, 128.54] | 0.0                  | 0.0                        
---------------------------------------------------------------------
Total            | 100.0                | 100.0                      

Key: █ = shared with Target, ▒ = excess,    ] = deficit

dfs = [
    {"df": target.covars().df, "weight": target.weight_series},
    {"df": sample_with_target.covars().df, "weight": sample_with_target.weight_series},
    {"df": adjusted.covars().df, "weight": adjusted.weight_series},
]
print(ascii_comparative_hist(
    dfs, names=["Target", "Unadjusted", "Adjusted"],
    column="income",
))

=== income (numeric, comparative) ===

Range            | Target (%)    | Unadjusted (%)    | Adjusted (%)  
---------------------------------------------------------------------
[0.00, 8.57)     | ████████ 49.0 | ████████▒▒▒▒ 73.2 | ████████▒ 54.8
[8.57, 17.14)    | ████ 23.1     | ███] 19.2         | ████ 26.3     
[17.14, 25.71)   | ██ 13.2       | █] 5.3            | ██ 12.3       
[25.71, 34.28)   | █ 7.3         | ] 1.6             | █ 3.9         
[34.28, 42.85)   | █ 3.9         | ] 0.4             | ] 1.5         
[42.85, 51.41)   | 1.8           | 0.1               | 0.2           
[51.41, 59.98)   | 0.9           | 0.2               | 1.0           
[59.98, 68.55)   | 0.4           | 0.0               | 0.0           
[68.55, 77.12)   | 0.2           | 0.0               | 0.0           
[77.12, 85.69)   | 0.1           | 0.0               | 0.0           
[85.69, 94.26)   | 0.0           | 0.0               | 0.0           
[94.26, 102.83)  | 0.0           | 0.0               | 0.0           
[102.83, 111.40) | 0.0           | 0.0               | 0.0           
[111.40, 119.97) | 0.0           | 0.0               | 0.0           
[119.97, 128.54] | 0.0           | 0.0               | 0.0           
---------------------------------------------------------------------
Total            | 100.0         | 100.0             | 100.0         

Key: █ = shared with Target, ▒ = excess,    ] = deficit

# Comparative mode (default) — numeric variables show excess/deficit vs baseline
adjusted.covars().plot(
    library="balance", variables=["income"],
);

=== income (numeric, comparative) ===

Range            | population (%) | adjusted (%)   | sample (%)       
----------------------------------------------------------------------
[0.00, 8.57)     | ████████ 49.0  | ████████▒ 54.8 | ████████▒▒▒▒ 73.2
[8.57, 17.14)    | ████ 23.1      | ████ 26.3      | ███] 19.2        
[17.14, 25.71)   | ██ 13.2        | ██ 12.3        | █] 5.3           
[25.71, 34.28)   | █ 7.3          | █ 3.9          | ] 1.6            
[34.28, 42.85)   | █ 3.9          | ] 1.5          | ] 0.4            
[42.85, 51.41)   | 1.8            | 0.2            | 0.1              
[51.41, 59.98)   | 0.9            | 1.0            | 0.2              
[59.98, 68.55)   | 0.4            | 0.0            | 0.0              
[68.55, 77.12)   | 0.2            | 0.0            | 0.0              
[77.12, 85.69)   | 0.1            | 0.0            | 0.0              
[85.69, 94.26)   | 0.0            | 0.0            | 0.0              
[94.26, 102.83)  | 0.0            | 0.0            | 0.0              
[102.83, 111.40) | 0.0            | 0.0            | 0.0              
[111.40, 119.97) | 0.0            | 0.0            | 0.0              
[119.97, 128.54] | 0.0            | 0.0            | 0.0              
----------------------------------------------------------------------
Total            | 100.0          | 100.0          | 100.0            

Key: █ = shared with population, ▒ = excess,    ] = deficit

# Grouped-bar mode — numeric variables use the same bar style as categorical
adjusted.covars().plot(
    library="balance", variables=["income"],
    comparative=False,
);

=== income (numeric) ===

Bin              | population  adjusted  sample
                 |
[0.00, 8.57)     | ███████████████████████████████████ (49.0%)
                 | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (54.8%)
                 | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (73.2%)
[8.57, 17.14)    | ████████████████ (23.1%)
                 | ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ (26.3%)
                 | ▐▐▐▐▐▐▐▐▐▐▐▐▐▐ (19.2%)
[17.14, 25.71)   | █████████ (13.2%)
                 | ▒▒▒▒▒▒▒▒▒ (12.3%)
                 | ▐▐▐▐ (5.3%)
[25.71, 34.28)   | █████ (7.3%)
                 | ▒▒▒ (3.9%)
                 | ▐ (1.6%)
[34.28, 42.85)   | ███ (3.9%)
                 | ▒ (1.5%)
                 | . (0.4%)
[42.85, 51.41)   | █ (1.8%)
                 | . (0.2%)
                 | . (0.1%)
[51.41, 59.98)   | █ (0.9%)
                 | ▒ (1.0%)
                 | . (0.2%)
[59.98, 68.55)   | . (0.4%)
                 |  (0.0%)
                 |  (0.0%)
[68.55, 77.12)   | . (0.2%)
                 |  (0.0%)
                 |  (0.0%)
[77.12, 85.69)   | . (0.1%)
                 |  (0.0%)
                 |  (0.0%)
[85.69, 94.26)   | . (0.0%)
                 |  (0.0%)
                 |  (0.0%)
[94.26, 102.83)  | . (0.0%)
                 |  (0.0%)
                 |  (0.0%)
[102.83, 111.40) |  (0.0%)
                 |  (0.0%)
                 |  (0.0%)
[111.40, 119.97) |  (0.0%)
                 |  (0.0%)
                 |  (0.0%)
[119.97, 128.54] | . (0.0%)
                 |  (0.0%)
                 |  (0.0%)

Legend: █ population  ▒ adjusted  ▐ sample
Bar lengths are proportional to weighted frequency within each dataset.

ASCII Plots Tutorial¶

Setup¶

1. All covariates at a glance¶

2. Category spacing¶

3. Focusing on a single variable¶

4. Grouped histogram (`ascii_plot_hist`)¶

5. Comparative histogram (`ascii_comparative_hist`)¶

Three-way comparative histogram¶

6. Grouped histograms via `comparative=False`¶

ASCII Plots Tutorial¶

Setup¶

1. All covariates at a glance¶

2. Category spacing¶

3. Focusing on a single variable¶

4. Grouped histogram (ascii_plot_hist)¶

5. Comparative histogram (ascii_comparative_hist)¶

Three-way comparative histogram¶

6. Grouped histograms via comparative=False¶

4. Grouped histogram (`ascii_plot_hist`)¶

5. Comparative histogram (`ascii_comparative_hist`)¶

6. Grouped histograms via `comparative=False`¶