A comparison of compact, presumably young with extended, evolved radio active galactic nuclei

Radio sources with peaked spectra (peaked spectrum sources, PSS) and compact symmetric objects (CSO) are powerful, compact, and presumably young AGNs and therefore particularly suitable to study aspects of the AGN-host connection. We use a statistical approach to compare a PSS-CSO sample with a matching comparison sample of extended sources (ECS). We find significant differences between the two samples. In particular, we find that the ECS sample has a higher proportion of passive galaxies with a lower star formation activity. This applies to both sub-samples of QSOs or radio galaxies as well as to the entire sample. The star formation rates of the PSS-CSO host galaxies are typically in the range 0 to 5 M_sun/yr and the stellar masses are in the range 3x10^11 to 10^12 M_sun. Secondly, in agreement with previous results, we find a remarkably high proportion of PSS-CSO host galaxies with merger signatures. The merger fraction of the PSS-CSO sample is 0.61, which is significantly higher than that of the comparison sample (0.15). We suggest that this difference can be explained by assuming that the majority of the PSSs and CSOs cannot evolve to extended radio sources and are therefore not represented in our comparison sample.


Introduction
Radio-loud active galactic nuclei (AGNs) are typically associated with powerful bipolar radio jets (Tadhunter 2016;Hardcastle & Croston 2020;Saikia 2022).The jets transport energy and momentum from the accreting supermassive black hole (SMBH) into the surrounding interstellar medium of the host galaxy and the intra-cluster medium of a galaxy cluster.This so-called jet (or kinematic) feedback is thought to have an impact on the fate of the ambient gas and thus on the star formation (SF) history of galaxies (McNamara & Nulsen 2007;Fabian 2012;Morganti 2017;Venturi et al. 2021).
The projected size of the radio jets is usually larger than that of typical host galaxies and can reach a total extent of up to several megaparsecs.On the other side, there are also compact radio galaxies that are just as energetic as the most luminous extended ones, but much smaller and confined within their host galaxies.The gigahertz peaked spectrum (GPS) sources, compact steep spectrum (CSS) sources, and the high-frequency peakers (HFPs), collectively referred to as peaked spectrum sources (PSSs), belong to this class of radio AGNs.In the milliarcsecond resolution radio images taken with the very long baseline interferometry technique, these sources show a similar morphology to typical radio AGNs, albeit with a linear size extending only to a few kiloparsecs (Wilkinson et al. 1994;Stanghellini et al. 1997;Orienti et al. 2006).The flat-spectrum radio core is surrounded by symmetric steep-spectrum lobes on both sides, similar to the large-scale morphology of extended radio galaxies.The radio spectra of PSSs deviate from the canonical straight power law and are characterised by a concave (inverted) shape with steep slopes on either side of the peak.The peak frequencies (in the rest frame) are around 1 GHz for GPS, below 500 MHz for CSS, and above 5 GHz for HFP sources.Synchrotron self-absorption by relativistic electrons inside the radio-emitting source (Slish 1963) or free-free absorption by ionised plasma either inside or outside is responsible for the inverted shape of the radio spectrum of the PSSs (Kellermann 1966).A comprehensive review on the PSSs was given by O'Dea (1998) and O' Dea & Saikia (2021).
The origin and evolution of the PSSs are still a matter of debate.Kinematic ages derived from the motion of the hot spots indicate a young age (< 5000 years) for the radio activity (Owsianik & Conway 1998;Polatidis & Conway 2003;Gugliucci et al. 2005).This is also supported by the spectral age estimates from modelling the high-frequency spectral break (Murgia et al. 1999;Orienti & Dallacasa 2010).Statistical studies, however, indicate an overabundance of PSSs over the extended radio galaxies (Readhead et al. 1996;O'Dea & Baum 1997;An & Baan 2012).Slob et al. (2022) assumed that the PSSs selected at low frequencies (< 144 MHz) represent young radio galaxies.Their conclusion is based on the analysis of the luminosity function and radio source counts of PSSs, both of which are found to be scaled down by a significant factor.In an alternative evolution scenario, the PSSs are considered as 'old and frustrated' sources that are confined to the host galaxies due to an overdense interstellar medium (van Breugel et al. 1984;Bicknell et al. 1997;Dicken et al. 2012).The high detection rate of H i absorption amongst PSSs over extended radio galaxies may indicate that a high proportion of PSSs are embedded in a rich interstellar medium (Morganti & Oosterloo 2018).Furthermore, for some PSSs, the intermittent nature of AGN activity may lead to their truncated evolution (Reynolds & Begelman 1997;Kunert-Bajraszewska et al. 2006).
Concerning the present work, we briefly summarise the following findings on the host galaxies of compact radio sources from O'Dea & Saikia (2021): The typical hosts appear to be large elliptical galaxies with magnitudes around the Schechter luminosity, but there are also exceptions with significant disk components.The optical spectrum is dominated by an old stellar population, but often current or ongoing SF is also indicated, where the star formation rates (SFRs) are typically a few to tens M ⊙ yr −1 .The optical emission line spectra are very similar to those of the extended radio sources, both types can be divided into high-excitation radio galaxies (HERGs) and low-excitation radio galaxies (LERGs).Some well-studied hosts of compact radio sources are classified as galaxy mergers and a few systematic studies indicate that mergers and interactions are common.Their environment appears similar to that of powerful large radio sources and they are not predominantly in rich galaxy clusters.
A critical source of information for a systematic investigation of the host galaxies are optical spectra.Liao & Gu (2020) compiled a sample of 126 compact radio galaxies with spectroscopy available from the Sloan Digital Sky Survey (SDSS; York et al. 2000).Besides GPS, CSS, and HFP sources, this sample also includes Compact Symmetric Objects (CSOs), which have similar properties as GPS radio galaxies.Liao & Gu (2020) derived the SMBH masses and the Eddington ratios and found that they cover broad ranges with log M SMBH /M ⊙ ∼ 7 to 10 and log R Edd ∼ −4.9 to 0.4.Nascimento et al. (2022) investigated the optical and mid-infrared (MIR) properties of a sample of compact radio sources with SDSS spectra consisting of 58 CSS or GPS sources and 14 Megahertz-Peaked-Spectrum (MPS) sources at z ≤ 1 from various radio-selection catalogues publicly available in the literature.Based on stellar population synthesis, they concluded that their sample is dominated by intermediate to old stellar populations in a wide range of morphological galaxy types.The SFRs cover the range from zero to ∼ 20M ⊙ yr −1 , for most sources the SFR is < ∼ 5M ⊙ yr −1 .No strong correlation between the optical and radio properties of these sources was found.Recently, Gordon et al. (2023) investigated the relationship between SF and radio source size in a sample of CSS sources and concluded that the bulk of SF ceased several hundred Myr before the jet was triggered, where the broad range of SFRs is possibly a consequence of episodic SF that can result from galaxy-galaxy interactions.Using complete samples of CSOs, Kiehlmann et al. (2023) argued that their relative numbers and the distributions of redshifts and sizes show conclusively that most CSOs belong to a distinct population of jetted AGNs that do not evolve into larger radio sources and should be described as 'short-lived', rather than 'young'.Duggal et al. (2023) carried out a systematic search for UV signatures from AGN feedback in CSS radio galaxies based on new HST images.They claim that their observations are in line with recent SF activity 'likely triggered by current or an earlier episode of radio emission, or by a confined radio source that has frustrated growth due to a dense environment'.
The present work compares compact and presumably young radio galaxies from Liao & Gu (2020) with old, extended radio galaxies.Special attention is paid to the properties of the host galaxies.Because the Liao & Gu (2020) sample is heterogeneous, O'Dea & Saikia (2021) warned that it may not match well with other samples in the literature.For this reason, we have created a matched 'comparison sample' of extended radio galaxies and QSOs which we assume represent advanced stages of radio source evolution.We look for significant differences between the Liao & Gu (2020) sample and the comparison sample, where we focus on three main aspects: (i) spectral energy distributions (SEDs), in particular, composite spectra and multiband SEDs (MBSEDs) from the far ultraviolet (UV) to MIR and their implications for the stellar population of the host galaxies, (ii) SF activity, and (iii) morphological indicators of gravitational interactions and merging.Our study is based entirely on existing archival data.It complements the study of Nascimento et al. (2022) in which they compare the different types of compact sources and look for correlations between optical/MIR and radio properties, while our work focuses on the comparison of the sample of compact radio sources with the comparison sample of extended sources.Of course, their sample of 72 compact radio sources at z ≤ 1 with spectra in SDSS Data Release (DR) 12 (Alam et al. 2015) strongly overlaps with the sample of our study.The low-redshift (z ≤ 1) subsample of the Liao & Gu (2020) sample contains 94 sources, the total sample covers a larger redshift range up 3.6.We used the optical spectra from the SDSS DR16 (Ahumada et al. 2020).
This paper is structured as follows.In Sect.2, we describe our sample of compact radio sources and explain in detail how the comparison sample of extended radio sources was constructed.In Sect.3, we discuss three different diagnostic diagrams and compare the distributions of the spectral types from the two samples.In Sect.4, we use the SDSS composite spectra, the MBSEDs, and the MIR data to discuss the spectral energy distributions.The SFRs and stellar masses are discussed in Sect. 5.In Sect.6, we determine the merger fractions in the two samples and discuss possible reasons for the differences.The summary is presented in Sect.7. Additional data are provided in the Appendices.

PSS-CSO sample
Liao & Gu (2020) collected all the radio samples of PSSs and CSOs available in the literature and created a large sample of 250 CSS, 148 GPS, 116 HFP sources, and 80 CSOs.By crossidentification with the SDSS, a sample of 147 sources with spectroscopic counterparts was selected from this, which they called the 'optical sample'.After removing blazar-type sources, which contaminate this sample, their 'final optical sample' consists of 126 objects.Liao & Gu (2020) and Liao & Gu (2021) have described the sample in detail and listed several properties of the individual objects.The largest angular size (LAS) and the largest linear size (LLS) of the radio structure are given there for 109 sources (87%), all sources larger than 0.6 kpc are classified as CSS sources.With one exception, LLS = 56 kpc for J090933.49+425346.5, all listed sizes are smaller than 40 kpc.
In the present work, this PSS-CSO sample was slightly reduced for two reasons.Firstly, as will be explained in Sect.2.2 below, the comparison sample is selected among extended radio AGNs with LLS > 50 kpc in the Very Large Array Sky Survey (VLASS; Lacy et al. 2020).For consistency, we therefore also checked the extents of the PSSs and CSOs in the VLASS images.For eleven objects that were found to be extended or possibly double we measured the FR type and size (Table 1).All remaining objects do not appear to have been resolved in our visual inspection of the VLASS images.Depending on the FR type (Fanaroff & Riley 1974), the LAS was identified with the distance between the hot spots for FR II and with the extent of the 5σ contour for FR I and uncertain classifications, where σ is the r.m.s. of the local background.The three sources with LLS > 50 kpc are shown in Fig. 1.Traditionally the size limit has been put at 20 kpc for PSSs and CSOs (O'Dea & Saikia 2021).The PSS-CSO sample contains 4 sources with sizes larger than 50 kpc and seven sources with LLS between 20 and 50 kpc.For the present study, we have thus decided to use a strict size limit of 50 kpc to distinguish compact from extended sources.Therefore, we removed SDSS J090933.49+425346.5 and the three sources shown in Fig. 1.The case of J134536.94+382312.5 is not entirely clear.We cannot be sure that the two radio sources seen in the image are physically connected, but we cannot rule it out either.We decided to remove this object from the sample as a precaution.
Secondly, we removed the CSS source 4C +39.29, which was assigned to SDSS J101714.23+390121.1 at z = 0.211 by Liao & Gu (2020).We argue that the host galaxy of 4C +39.29 is SDSS J101714.10+390123.8 at z = 0.536, for which no SDSS spectrum is available.This interesting source is discussed in more detail in Appendix A.
We refer to the remaining sample of 121 sources as the 'PSS-CSO sample'.The visual inspection of the SDSS spectra of all sources revealed incorrect redshifts from the SDSS spectroscopic pipeline for SDSS J014109.16+135328.3 (z = 0.621 instead of 0.236 given by SDSS) and SDSS J080133.55+141442.8 (z = 1.196 instead of 0.246).We note that, in both cases, the correct redshift is given by Liao & Gu (2020).For the source SDSS J085408.44+021316.1 we prefer the value z = 0.400 from SDSS to the value 0.459 given by Liao & Gu (2020).The PSS-CSO table containing the properties derived here is published at CDS, an excerpt from the table (the first six lines) is given in Table C.1.With the exception of four sources outside the FIRST footprint area, the radio luminosity L 1.4GHz was calculated from the integrated 1.4 GHz flux density provided by FIRST (Helfand et al. 2015).For the remaining four sources, we took the flux densities from the NED 1 .For all other data we refer to Liao & Gu (2020).

Selection of the comparison sample
As mentioned above, the Liao & Gu (2020) sample was inhomogeneously selected and cannot be considered statistically complete and representative of the entire population.In particular, their optical sample is biased towards low redshifts and, as a consequence, towards low luminosities.For the present study, it is thus of key importance to select a control sample in a consistent way.The construction of such a 'matched' comparison sample consists of two steps.First, a basic sample must be defined from which the control sample is to be selected.Especially in the frame of the evolutionary scenario, it is meaningful to compare 1 https://ned.ipac.caltech.edu/the PSS-CSO sample with their evolved counterparts.Because the physical size of a radio AGN is expected to grow with time, extended radio galaxies can be considered later stages of evolution than compact sources, if a similar environment is assumed.Therefore, a possible way is to select the comparison galaxies from a basic sample of extended radio galaxies and QSOs with sizes larger than the upper limit for the PSS-CSO sources.For the present study we use a size limit of 50 kpc (Sect 2.1).
The main difficulty in creating such a basic sample of extended radio galaxies is the fact that only a small percentage of radio sources identified in SDSS show extended radio emission  2011)).In the middle column, the final ECS sample (green) is compared with the PSS-CSO sample (blue).Right column: histogram distributions of the PSSs and CSOs (blue) and the ECSs (green).(Ivezić et al. 2002) and not all of them have SDSS spectra.As a result of an extensive literature search, we found several suitable data collections based on radio data from the Faint Images of the Radio Sky at Twenty-Centimeters (FIRST; Becker et al. 1995).Garon et al. (2019) extensively discussed a sample of 4304 extended radio galaxies from Radio Galaxy Zoo (RGZ).Their selection was restricted 'to sources with at least a 0.65 consensus in the radio component classification from RGZ to remove sources with ambiguous structure, as well as non-physical associations between radio lobes and coincident galaxies'.The host galaxies were identified via cross-matching with the images from the Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010), redshift data were taken from SDSS.Here, we extracted the subsample of the 1743 sources with spectroscopic redshifts.Kuźmicz & Jamrozy (2021) presented a sample of 174 newly discovered giant radio quasars (GRQs; their Table 1), which are defined as having a projected linear size greater than 700 kpc, and 367 smaller, lobe-dominated radio quasars found during their search for GRQs (their Table 2).This search was based on the cross-match of spectroscopically identified quasars and quasar candidates from SDSS with the NRAO Very Large Array Sky Survey (NVSS; Condon et al. 1998) and the FIRST survey.Another list of extended radio galaxies with SDSS spectroscopy was provided by Kozieł-Wierzbowska & Stasińska (2011), who created a sample of 401 FR II radio galaxies (no quasars) that have optical counterparts with SDSS spectra.Furthermore, we included the sample of 63 broad-line AGNs from the SDSS with extended emission in FIRST from Rafter et al. (2011).
All these data cover the redshift range z > 1 only insufficiently.Therefore, we added a subsample of quasars from the 'Catalog of Quasar Properties from SDSS DR7' (Shen et al. 2011).To include radio properties, Shen et al. (2011) matched the SDSS DR7 quasar catalogue with the FIRST catalogue with a matching radius of 30 ′′ (corresponding to ∼ 250 kpc at z = 2) and flagged those quasars which have multiple FIRST sources within 30 ′′ as lobe-dominated (FIRST = 2).We extracted a list of 2105 lobe-dominated quasars with z < 4 from that catalogue.Unlike the other data sources described above, the Shen et al. (2011) catalogue does not contain information of the LAS of the FIRST source.In the first step, we identified the LAS with the largest distance between two radio components from the FIRST catalogue within an aperture of 500 kpc diameter centred on the respective target, where only sources with a probability of being a side lobe less than 0.1 were considered.The final sizes were measured more precisely on the VLASS images when selecting the comparison sample (see below).
The combination of the above-mentioned five compilations results in a total of 3930 sources covering a wide range of radio power and size, with the vast majority having LLS > 50 kpc in the literature.As discussed above, we use a size limit of 50 kpc and define the sample of 3875 FIRST-SDSS sources with LLS > 50 kpc as the basic sample for selecting our comparison sample.The left column of Fig. 2 illustrates the distributions of z, radio luminosity L 1.4GHz at 1.4 GHz (rest frame), and absolute i band magnitude M i .The latter two were k-corrected following, for example, Kennefick & Bursick (2008), where we assumed F ν ∝ ν α with a mean spectral index α o = −0.5 in the optical and α r = −0.7 in the radio.This sample of extended radio galaxies with SDSS spectroscopy populates a region of parameter space that is similar but not identical to that of the PSS-CSO sample.In particular, it extends towards fainter 1.4 GHz luminosities, illustrating the well-known fact that PSSs and CSOs are among the most luminous radio AGNs.We note that all radio sources from this sample are more luminous than the threshold L 1.4GHz = 10 23 W Hz −1 above which the radio luminosity distribution of radio galaxies is dominated by AGNs (Sadler et al. 2002).
In the second step, the comparison sample was selected.Liao & Gu (2020) explicitly excluded relativistically beamed sources from their sample.To obtain a comparison sample that is also free from such sources, we have reduced the basic sample only to objects with steep radio spectra.In addition, we also checked the radio morphology and excluded compact, core-dominated sources in a later step (see below).We used the SPECFIND V3.0 catalogue (Stein et al. 2021) and found that 978 sources from our basic sample have steep radio spectra with α r < −0.5 measured at two or more frequencies ν > 1 GHz.This is the parent sample from which the comparison sample was finally drawn.The comparison sample was selected so that the distributions of the main physical parameters are as similar as possible to those of the PSS-CSO sample.Here, we consider the redshift z, the radio luminosity L 1.4GHz , the i band absolute magnitude M i , and the SDSS spectral class as the most relevant properties.In particular, this means that for a PSS-CSO with a host-dominated spectrum, the host galaxy of the corresponding source in the comparison sample has a similar mass.In practice, the apparent SDSS i band magnitude and the radio loudness R i are used after z is fixed.2That is, we selected for each PSS-CSO k with (z k , i k , R i,k ) a counterpart from the basic sample of extended sources with (z, i, R i ) within (z k ± ∆z, i k ± ∆i, R i,k ± ∆R i ), where the interval widths ∆z, ∆i, and ∆R i were chosen as small as possible.The resulting comparison sample thus has the same size as the PSS-CSO sample, that is, it consists of 121 sources.We henceforth refer to these sources as 'extended comparison sources' (ECSs).
Finally, we used the VLASS-SDSS overlays to determine the FR type and the LAS of the selected ECSs to ensure that they are indeed larger than 50 kpc.The LAS was measured in the same way as for the PSSs and CSOs (Sect.2.1).Compact, coredominated and unclear or suspicious radio sources were removed from the selection and replaced by other sources from the parent sample in an iterative process.In the final sample, 112 ECSs were classified as FR II and 9 as FR I, where for both FR types the classification is uncertain due to more complex structures for ∼10% of the sources.A mosaic of the VLASS-SDSS overlays is shown in Figure D.1.Figure 3 shows the histogram distributions of the radio slope α r (left) and the LLS (right).The full table of the ECS sample is published in electronic form at CDS, an excerpt is shown in Table C.2.In the middle column of Fig. 2, the ECS positions in the parameter space z − M i − log L 1.4 GHz are compared with those of the PSSs and CSOs.The histogram distributions of log L 1.4GHz , M i , and z are shown in the right column.It is obvious that both samples populate almost the same area of the parameter space.A small but notable difference is the lack of ECSs with log L 1.4GHz > 28.4.On the other hand, adopting a threshold R i = 1 to distinguish radio-quiet from radio-loud sources, 100% of the ECS and 98% of the PSS-CSO sample are radio-loud.The two sources J103719.33+433515.3 (R i = 0.61) and J105731.17+405646.1 (R i = 0.01), which have the lowest redshifts (z = 0.025) and the lowest 1.4 GHz luminosity, are not radio-loud according to our criterion.As can be seen in Fig. 2, the parameter space around these two sources is almost empty.This means that the selection intervals ∆z, ∆i, and ∆R i had to be extended considerably.Both selected ECS counterparts are radio-loud.We performed the two-sided two-sample Kolmogorov-Smirnov (KS) test with the null hypothesis H 0 that For the PSS-CSO sample, the sizes are taken from Liao & Gu (2020).The dashed lines depict evolutionary tracks from An & Baan (2012) for sources of high radio-power (red) and low radio-power (blue).
the histogram distributions in the panels of the right column of Fig. 2 are the same for the two samples against the alternative hypothesis H A that they are different.H 0 is discarded in favour of H A if the test statistic exceeds a critical value that depends on the significance level α.The test result says that there is no reason to discard H 0 for all three quantities at a significance level α = 0.05 (i.e. the probability of mistakenly not rejecting H 0 is < 0.05).We also note here that the proportion of the SDSS spectral class GALAXY is very similar for the two samples: 43 ± 5 % for the ECSs and 45 ± 5 % for the PSS-CSO sample.
Fig. 4 shows the PSS-CSO and the ECS samples in the power-size diagram.The LLSs of the PSSs and CSOs are from Liao & Gu (2020).The size distributions of the two samples do not overlap.We over-plotted two evolutionary pathways from An & Baan (2012) based on parametrised models.It is obvious that the positions of the selected ECSs are not inconsistent with the assumption that they represent later evolutionary stages of sources similar to those from the PSS-CSO sample.

Diagnostic emission line diagrams
The BPT diagrams of diagnostic line ratios (Baldwin et al. 1981), in particular the [O iii] 5007Å/Hβ versus [N ii] 6585Å/Hα diagram, are commonly used to classify emission line galaxies.(Hereafter, we refer to these lines as [O iii] and [N ii].)For line data from SDSS spectra, the diagram applies to sources at z < ∼ 0.35.The top panel of Fig. 5 shows the [N ii]-based BPT diagram for the galaxies (SDSS spectral class GALAXY) with z < 0.35 and a signal-to-noise ratio S/N > 3 for all four lines.The line ratios were computed from the line fluxes, we did not try to correct for intrinsic reddening.The fluxes and the equivalent width (EWs) are available from the SDSS table galSpecLine for the majority of the sources and were measured by us from the SDSS spectra for the rest.The mean ratios are listed in Table 2.The line ratios plotted in the BPT diagram can be used to distinguish between high-excitation radio galaxies (HERGs) and low-excitation radio galaxies (LERGs).With the criterion of Buttiglione et al. (2010) we find 23 LERGs and 4 HERGs among the PSS-CSO galaxies with S/N > 3, to be compared with 12 LERGs and 6 HERGs for the ECS sample.The ECS sample thus appears to contain a higher proportion of HERGs (Table 2).However, the ECS sample also contains a higher proportion of galaxies with low-S/N spectra that are not included in the diagram and are most likely LERGs (see below).All HERGs are classified as Seyferts in the BPT diagram., in combination with the [O iii]/Hβ line ratio and can be applied up to z ∼ 0.9.The basic physics behind it is the difference between AGNs and SF galaxies in the kinematics of the ionised gas in combination with the correlation between gas kinematics, stellar mass, and metallicity.(The [N ii]/Hα ratio is known to correlate with the metallicity in SF galaxies.)The KEx diagram for the galaxies from the PSS-CSO and the ECS The mean value is 25% higher for the former than for the latter (Table 2).We applied the one-sided T test for two independent means to test H 0 : at a significance level of α = 0.05.The result is that H 0 must be clearly rejected in favour of H A (p = 0.01).
In both the BPT diagram and the KEx diagram, the two samples show similar distributions of the line ratios.The KS test shows that there is no reason to reject the null hypothesis that the PSS-CSO and the ECS samples come from the same distribution at a significance level α = 0.05.However, the selection constraint S/N > 3 excludes a significant proportion of galaxies in both samples.In the BPT diagram, 17% of the PSSs and CSOs are missing because they have only weak emission lines, especially weak Hβ lines.For the ECS sample, however, this proportion is about twice as high (39%).This difference is significant according to the two-sided Z test (p = 0.005).
The bottom panel of Fig. 5 shows the WHAN diagram (Cid Fernandes et al. 2011), which uses only the EW of the Hα line in combination with the line ratio [N ii]/Hα.It was developed as a robust and economic classification scheme to cope with the large population of weak-line galaxies in SDSS.Following Cid Fernandes et al. (2011), five different regions can be distinguished: pure star-forming galaxies (SF), strong AGNs (sAGN; e.g.Seyferts), weak AGNs (wAGN; e.g.LINERS), retired galaxies (RG; i.e. galaxies that have stopped SF and are ionised by evolved low-mass stars), and passive, basically lineless galaxies.The majority of the ECS galaxies, 67%, are classified as retired or passive, compared to 14% in the PSS-CSO sample.This difference is highly significant and the KS test yields a significant difference between the EW(Hα) distributions of the two samples (p < 10 −5 ).
Next, we calculated the number of objects in the regions indicated by the demarcation lines of the three diagnostic diagrams.The resulting histogram distributions are shown in Fig. 6.To find out whether these differences are significant or just by chance, we used Pearson's chi-square test for two independent samples.There is no reason to reject the null hypothesis of no difference between the PSS-CSO and the ECS samples at α = 0.05 for the [N ii]-based BPT diagram and the KEx diagram.However, the WHAN diagram reveals a highly significant difference (p = 4 • 10 −5 ) with a significantly larger proportion of weak-line galaxies in the ECS sample.

SDSS composite spectra
Composite spectra provide insight into average spectral properties of AGN samples (e.g.Francis et al. 1991;Brotherton et al. 2001;Vanden Berk et al. 2001;Reichard et al. 2003;Pol & Wadadekar 2017;Ren et al. 2022).We followed the approach described in detail by Vanden Berk et al. (2001).The spectra were downloaded from SDSS DR16 and inspected individually to check the redshifts and spectral classification (see also Sect.2.1).Next, the spectra were corrected for foreground extinction using E(B − V) from Schlafly & Finkbeiner (2011) provided by IRSA's Galactic Dust Reddening and Extinction Service 3 .The corrected spectra were rebinned to a common wavelength scale in their restframe.Afterwards, the corrected spectra were normalised, i.e. the spectra were ordered by redshift, the flux density of the lowest-z spectrum was arbitrarily scaled and the other spectra were scaled in order of redshift to the average of the flux density in the common wavelength region of the mean spectrum of all the lower-z spectra.Finally, the median flux density and the standard deviation of all scaled spectra were determined in each wavelength bin.
It is useful to consider the composite spectra of quasars and galaxies separately.Therefore, we subdivided the PSS-CSO sample into two sub-samples differentiated by spectral class (G = GALAXY and Q = QSO), i.e.PSS-CSO-Q and PSS-CSO-G.In the same way, the ECS sample is subdivided into the two subsamples ECS-Q and ECS-G.The final arithmetic median composite rest-frame spectra are shown in Fig. 7 with the two PSS-CSO composites at the top and the ECS composites at the bottom.For comparison, the composite spectrum of radio-loud quasars from the FIRST Bright QSO Survey (FBQS; Brotherton et al. 2001) is overplotted (magenta).All composite spectra have been scaled such that the mean flux density has the value of 1 in the QSO continuum window 3030 − 3090 Å, but the spectra for the galaxy subsamples were shifted upwards by 2.5 flux units for clarity.The continua of the four composites were fitted by model spectra (see Sect. 4.1.2),the best-fitting model spectra are also plotted (green).At first glance, the composites for the same spectral class are quite similar.The more subtle differences are discussed below.

Emission lines and line ratios
The arithmetic median preserves the relative fluxes along the spectra, the composites can thus be used to measure diagnostic line ratios.We measured the (arbitrarily scaled) line fluxes 3 https://irsa.ipac.caltech.edu/applications/DUST/docs/background.html from our composite spectra after subtraction of a modelled continuum (Sect.4.1.2).The [N ii]-Hα complex was de-blended using a four-component Gaussian fit with a broad Hα component and three narrow components for Hα, [N ii] λ 6549Å, and [N ii] λ 6585Å.For the measurement of the Hβ line, a twocomponent Gaussian fit was used for the narrow and the broad line components.Only the narrow Balmer components were used for the line ratios [N ii]/Hα and [O iii]/Hβ.The results (Table 3) can be compared with the diagnostic diagrams in Sect.3.For all four subsamples, the line ratios from the composite spectra are clearly in the region of strong AGNs in the BPT and the KEx diagram.In accordance with Table 2, the ECS-G sample has a higher line ratio [O iii]/Hβ than the PSS-CSO-G sample. 4n the WHAN diagram, three subsamples are also in the region of strong AGNs, while the ECS-G subsample is close to the border between weak AGNs and retired galaxies, which reflects the weak Hα line in the composite spectrum.
All four composites in Fig. 7 show relatively strong forbidden lines Liao & Gu (2020) found a significant correlation between the [O iii] line luminosity and the 5 GHz luminosity in their PSS-CSO sample.A correlation of the relative strength of [O iii], radio loudness and radio luminosity is known also from other studies (Andika et al. 2020;Wang et al. 2022).With this in mind, strong lines are to be expected because the radio sources in our samples are radio-loud and belong to the upper end of the radio luminosity distribution at their redshift (Sect.2.2).Radio jets may play an important role in ionising the narrow line region (NLR) gas and the [O iii] and [O ii] lines could be significantly amplified by shocks caused by strong jet interaction with the line emitting gas clouds (e.g.Best et al. 2000;Labiano 2008;Kalfountzou et al. 2012;Mullaney et al. 2013;Andika et al. 2020).The higher [O iii]/Hβ ratio of the ECS-G sample compared with the PSS-CSO-G sample is consistent with the idea that the [O iii] line strength is enhanced significantly by the jet expansion through the host galaxy (Labiano 2008).In this context, we also note that the blends of Fe ii lines and Balmer continuum between ∼ 2000 and 4000 Å (the '3000 Å bump') are relatively weak in both QSO composites of the present study, in agreement with the known anti-correlation between the [O iii] line and the strength of the Fe ii emission (Boroson & Green 1992).
In Sect.3, we found that the mean [O iii] line dispersion for the PSS-CSO galaxies is 25% larger than for the ECS galaxies, but these galaxy samples are biased against spectra with weak Hβ lines (and thus high [O iii]/Hβ ratios).In the PSS-CSO-G composite spectrum, the FWHM of the [O iii] line is also larger than that of the ECS-G composite, but the difference is now only ∼ 5% (Table 3).
The host galaxies of radio-loud QSOs are known to have elevated [O ii] with respect to hosts from radio-quiet QSOs.The [O ii] line is stronger in the PSS-CSO composites, in agreement with the fact that its emission-line strength correlates inversely with the radio source size (Best et al. 2000).The line ratio [O ii]/[O iii] is sensitive to the ionisation parameter (Penston et al. 1990;Kewley & Dopita 2002;Shirazi et al. 2014).The [O ii] line is often taken as a SF indicator, but it can also be emitted by low-density gas ionised by the AGN.On the other hand, the [Ne v] λ 3425Å line is unambiguously related to AGN activity, either via direct photo-ionisation from the AGN or ionisation from AGN-driven shocks.The [O ii]/[Ne v] line ratio can thus be taken as an indicator of whether there is an excess of [O ii] emission that can be attributed to radio jets acting within the host galaxies or to SF in the host galaxies (Maddox 2018).This excess is higher in the PSS-CSO than in the ECS samples (Table 3).

Continuum and absorption lines
The composite spectra of the galaxy subsamples are dominated by the light from an old stellar population.There is, however, a discernible difference between the PSS-CSO-G and the ECS-G composite: the former has a flatter slope between ∼ 3000 and 4500 Å, indicating an additional blue component.We fitted the corresponding composites by model spectra consisting of two stellar components represented by the template spectra of a 13 Gyr old E galaxy (Ell 13) and an Sc galaxy from Polletta et al. (2007), where their relative proportion is a free parameter.Another free parameter is the internal reddening E(B − V) of the starlight assuming the reddening law for Milky Way dust from Pei (1992).In reality, the stellar populations of the host galaxies may show a much broader variety and we cannot assume that such a simplified model perfectly reflects the observed spectrum in all its details (e.g.Schmitt et al. 1999).Nevertheless, we find that the model provides a reasonable fit to the continuum in the wavelengths range λ ≈ 3000 to 7000 Å.In the bestfit model for the PSS-CSO-G sample, the Sc component contributes ∼ 40 ± 3 % of the starlight at 3000 − 3200 Å, compared to only ∼ 7 ± 3 % for the ECS-G sample.The internal reddening is slightly stronger in the PSS-CSO-G model, E(B − V) = 0.07 compared to 0.03 for ECS-G.The differences between the observed composites and the model spectra are shown in Fig. 8.We note again the stronger [O ii] line in the PSS-CSO-G sample, in accordance with Sect.4.1.1 and Table 2.
In an alternative model, we described the blue component by a power-law F λ ∝ λ α λ for the AGN continuum with α λ = −0.5 for radio-loud AGNs (Brotherton et al. 2001).Here, the fraction of the AGN continuum is a free parameter.This model provides a similarly good match when the AGNs contribute ∼ 20% to the flux at 3000 − 3200 Å in the PSS-CSO-G sample, but negligibly little in the ECS-G sample.A moderate internal reddening of E(B − V) < ∼ 0.05 cannot be excluded but does not significantly improve the fits.
Adding the blue component has the effect of reducing the discontinuity in the model spectrum at ∼ 4000 Å.For a single stellar population, the strength of this discontinuity increases with age, but it depends also on metallicity (e.g.Bruzual 1983;Vazdekis et al. 2015) and, due to the mass-metallicity relation (Tremonti et al. 2004), on stellar mass (Haines et al. 2017).Because of the large stellar masses of the host galaxies in our samples (Table 2 and Sect.5), it may be that the 4000 Å discontinuity is stronger than in the Ell13 template spectrum, which would increase the necessary contribution from the blue component.However, this effect should be similar for both subsamples.
To summarise, the mean stellar populations of the hosts in the PSS-CSO-G and ECS-G subsamples appear to be slightly different.The flatter slope of the PSS-CSO-G composite at ∼ 3000 − 4500 Å suggests a stronger contribution from a blue component, which could be either a younger stellar population or AGN continuum.If PSSs and CSOs are young, the former would suggest a link between triggering radio activity and star formation.The latter could also be understood as an evolutionary effect: Let us assume that a percentage of ∼ 20% of the AGNs in the PSS-CSO-G sample are in an accretion mode with a standard accretion disk.If the lifetime of such an AGN phase (i.e. the accretion disk) is smaller than the growth time of the extended radio structure, we can assume that the percentage of such AGNs in the ECS sample is much smaller.
Because the QSOs in our samples are radio loud, the QSO composites can be compared with the composite spectrum of the about 400 radio-loud quasars from the FBQS.Compared to the latter, our QSO composites have a slightly redder (i.e.flatter in Fig. 7) UV continuum.The continuum slope can be influenced by several physical properties, such as intrinsic reddening, inclination of the accretion disk, accretion rate, SMBH mass and spin (Francis et al. 1992;Richards et al. 2002;Reichard et al. 2003;Shankar et al. 2016), or by observational effects (e.g.Harris et al. 2016;Milaković et al. 2021).Here, we assume that reddening by dust in the host galaxy is the most plausible cause, although we cannot exclude that spectrum-to-spectrum variations in our small samples play a role.We de-reddened our composite spectra using the SMC reddening curve (Pei 1992) and found a good agreement of the UV continuum of both spectra with that of the FBQS composite if we assume an additional intrinsic reddening of ∆E(B − V) = 0.04.We note that Fig. 7 shows the de-reddened spectra.
The change of the slope of the QSO composites at λ > ∼ 4000 Å can be attributed to the stellar light from the host galaxies, as indicated by the easily recognisable stellar absorption lines Ca ii H and K, Mg i at 5190 Å, and Na ii at 5893 Å. Evidently, the composite spectra can be described as a combination of an emission-line spectrum, a featureless continuum, and an absorption-line galaxy spectrum.We modelled the continuum plus stellar light in a similar way as described above for the galaxy composites and found an acceptable fit up to ∼ 6000 Å if we assume that the AGN continuum accounts for 87 ± 3 % in the PSS-CSO-Q and 93 ± 3 % in the ECS-Q composite at 3000 to 3200 Å.The uncertainty is a few percent, so the difference is probably not significant.For both QSO subsamples, the stellar continuum is well described by the Ell 13 template, the contribution from the Sc template is nearly negligible (a few per cent).At wavelengths > ∼ 6000 Å, the models underestimate the observed continuum flux.Such a trend of a greater contribution from starlight with increasing wavelength, which is also seen in other QSO composites, could be caused by emission from hot dust (Vanden Berk et al. 2001, and references therein) and by the high mass and correspondingly high metallicity of the host galaxies (Zhu et al. 2010).
Some particular 'features' in QSO spectra have sometimes been associated with early evolutionary phases of AGNs, such as weak broad emission lines (e.g.Hryniewicz et al. 2010;Meusinger & Balafkan 2014;Kumar et al. 2023) or broad absorption lines (BALs; e.g.Priddey et al. 2007;Bruni et al. 2015;Zeilig-Hess et al. 2020).For example, iron low-ionisation BALs (FeLoBALs) were considered to represent an early-stage of merger-driven accretion and a transition between an obscured AGN and an ordinary optical QSO (e.g.Voit et al. 1993;Farrah et al. 2010;Wethers et al. 2021). 6BAL properties (the 'balnicity') and radio emission may be linked to the same underlying process (Morabito et al. 2019).Strong iron emission bands provide another type of unusual spectral feature.FeLoBAL QSOs often exhibit impressive Fe ii and Fe iii emission bands in the UV (Hall et al. 2002;Meusinger et al. 2012).Strong optical Fe ii emission is one of the classification characteristics of the rare type of Narrow Line Seyfert 1 (NLS1) galaxies (Goodrich 1989;Véron-Cetty et al. 2001).NLS1s harbour relatively lowmass SMBHs (M • < ∼ 108 M ⊙ ) that accrete at high Eddington ratios and are interpreted as young or rejuvenated AGNs (Komossa et al. 2006;Mathur et al. 2012).Radio-loud NLS1s show similarities with CSS sources (e.g.Yuan et al. 2008;Caccianiga et al. 2014;Berton et al. 2017).Liao & Gu (2020) identified the CSS source SDSS J133108.29+303032.9 as a NLS1, Järvelä et al. (2022) found five new NLS1s that exhibit properties consistent with the CSS classification.We see in our PSS-CSO composite spectrum neither signs of conspicuously weak broad emission lines nor strong Fe multiplets in emission or absorption.In addition, we note that there is no indication of a substantial unobscured post-starburst stellar population of an age of less ∼ 1 Gyr, which would be indicated by the Balmer absorption lines of A-and F-type stars, as has been observed in a few AGN galaxies (Liu et al. 2007;De Propris & Melnick 2014).

Multi-band spectral energy distributions
We used photometric data from major sky surveys to derive the multi-band spectral energy distributions (MBSEDs) from the far UV to the MIR.Apart from the SDSS five-band photometry, we used data from the Galaxy Evolution Explorer (Galex; Morrissey et al. 2007) survey in the UV, the Two Micron All-Sky Survey (2MASS; Skrutskie et al. 2006), the UKIRT Infrared Deep Sky Survey (UKIDSS; Lawrence et al. 2007;Hambly et al. 2008) in the near-infrared (NIR), and the AllWISE Data Release (Cutri et al. 2021) from the Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010) in the MIR.The magnitudes were extracted from the catalogues using the VizieR Service at CDS7 and the NASA/IPAC Infrared Science Archive (IRSA) 8 .On average, an object from our PSS-CSO sample has measurements in 11 different bands, 95% are measured in at least 8 bands and 68% in at least 10 bands.The magnitudes were corrected for Galactic foreground extinction using E(B − V) from Schlafly & Finkbeiner (2011) and converted into fluxes in the rest frame.
The observed MBSEDs were compared with galaxy template spectra from the SWIRE library (Lonsdale et al. 2003;Polletta et al. 2007), which consists of 25 SEDs covering the wavelength range from 0.1 to 1000 µm.It contains all major galaxy types, including ellipticals of different ages, early-type and late-type spirals, starburst (SB) galaxies, type 1 AGNs, type 2 AGNs, and composite types SB+AGN (Table 4).The spectra for the ellipticals, the spirals and the SB galaxies were generated with a spectral evolution code.The SB templates correspond to the observed SEDs of well-studied SB galaxies.Templates of Seyfert 1.8 and 2 galaxies were obtained by combining models, broadband photometric data and infrared spectra of a random sample of 28 Seyfert galaxies.Another six AGN templates represent optically selected QSOs with different infrared-to-optical flux ratios (QSO1, TQSO1, and BQSO1) and two versions of type 2 QSOs (QSO2 and Torus).The templates for composite types are empirical SEDs of objects that contain a powerful SB component, mainly responsible for their large infrared luminosities, and an AGN component that contributes to the MIR luminosities.We fitted all template spectra from Table 4 to the observed MBSEDs and determined the best-fitting template using the least chi-square method.Figure 9 shows nine examples of resulting MBSEDs in the rest-frame.The fluxes from the photometric data are shown as black squares with error bars.In most cases, the error bars are smaller than the symbol size.The best-fitting template is shown in red.Also plotted is the SDSS spectrum corrected for Galactic foreground reddening, transformed into the rest image and scaled to the MBSED in the optical domain (SDSS bands g, r, and i).To facilitate the use of the classification scheme, we have grouped the different SED templates into six groups (last column of Table 4).The meaning of the group number code is t SED = 1 for E and S0 galaxies ('early types'), 2 for spiral galaxies, 3 for SBs, 4 for SB-AGN composites (or transition objects), 5 for AGNs of type 1.8 to 2, and 6 for type 1 QSOs.The histograms in Fig. 10 show the distributions of these types in the PSS-CSO (blue) and the ECS (green) sample, respectively, with both samples subdivided again into the two subsamples G and Q.We also note that the histogram distributions from the second-best fit do not differ much from those in Fig. 10.As expected, there is a clear and strong difference between the G and Q subsamples, both for the PSS-CSO sample and the ECS sample.The vast majority of objects assigned to spectral class QSOs based on SDSS spectra, i.e. subsample Q, also have MBSEDs of type 1 QSOs.On the other hand, most of the objects from the subsample G are found to show MBSEDs of normal galaxies (early type or spiral).The proportion of early-type galaxies is higher in the ECS-G sample than in the PSS-CSO-G sample, but overall Fig. 10 shows a similar distribution of types in both samples.For a quantitative analysis, we performed Pearson's chi-square test for two independent samples to compare the distributions of types in the corresponding subsamples.The null hypothesis H 0 states that the subsamples PSS-CSO-G and ECS-G have the same distributions of types, and likewise the subsamples PSS-CSO-Q and ECS-Q.H 0 was tested against the alternative hypothesis, H A , that the two samples have different distributions.The results indicate that there is no reason to reject H 0 in favour of H A at α = 0.05.
Next, we focus on the proportions f sf of SF galaxies.For this purpose, we have summarised the templates in a coarser scheme, distinguishing only between types with active SF (t SED = 2 − 4) on the one hand and all other types (i.e.either dominated by an old stellar population or by an AGN) on the other.For this analysis, we included only sources with fluxes in at least eight wavelength bands, where at least two data points must be in the NIR or MIR.The resulting samples consist of 115 PSSs and CSOs and 116 ECSs, respectively.The fraction of SF galaxies is f PSS sf = 0.46 ± 0.05 for the PSS-CSO sample and f ECS sf = 0.28 ± 0.04 for the ECS sample.We applied the one-tailed two-sample Z test of proportions where the null hypothesis, H 0 : an error probability α.The result is that H 0 must be rejected in favour of H A at the 95% confidence level (α = 0.05), i.e. we can assume that the proportion of SF galaxies in the PSS-CSO sample is larger than that in the comparison sample (p = 0.006).On the other hand, the two-sided Z-test for the proportions of AGNs (including AGN-SF composites) shows that there is no evidence of different AGN proportions in the PSS-CSO and ECS samples.
The formation of a spheroidal component seems to be important for the history of SF in galaxies.Late-type disk-dominated spiral galaxies (Sbc, Sc, Sd) tend to actively form stars, while E and S0 are usually quiescent.In terms of both morphology and SF indicators, Sa galaxies lie between the passive galaxies and SF galaxies (Bendo et al. 2002;Hameed & Devereux 2005;Vika et al. 2015;González Delgado et al. 2016;Wang et al. 2018).This is also evident in our two samples.A total of 21 sources were classified as Sa.For 10 of them, the MIR colours (Sect.4.3) correspond to spiral galaxies with moderate SFR, while 11 are passive galaxies, similar to E and S0.Therefore, we have also performed the above analysis for an alternative classification scheme in which the Sa galaxies were classified as early.In this case, too, there is a clear difference between the two samples ( f PSS sf = 0.36 ± 0.05, f ECS sf = 0.23 ± 0.04, p = 0.014).The results from the analysis of the MBSEDs agree well with those derived from the composite spectra.In particular, they support the interpretation of the differences between the composite spectra as a consequence of different contributions from younger stellar components (Sect.4.1.2).There is a stronger association with younger star formation in the PSS-CSO sample.

WISE fluxes and colours
Observational data in the MIR spectral range are particularly useful for studying the relationship between the AGN emission and the contribution from SF in the host galaxy.We used the WISE data in the four bands W1, W2, W3, and W4 at the central wavelengths 3.4, 4.6, 12, and 22 µm, respectively (Wright et al. 2010).Jet-dominated AGNs and SF galaxies differ in the ratios of their MIR flux to the radio flux.To quantify the MIR-toradio ratio, Caccianiga et al. (2015) introduced the parameter q22 = log (F 22 µm /F 1.4 GHz ), where F 22 µm is the flux density in the W4 band and F 1.4 GHz is the 1.4 GHz flux density.Major star formation contribution to the radio emission can be expected in sources with q22 > 1 (Caccianiga et al. 2015).We computed q22 with k-corrections using the individual spectral slopes between the 12 and 22 µm (observed frame) flux densities for F 22 µm and the radio slope α r = −0.7 for F 1.4 GHz .In both samples, the q22 values are distributed over a large range, but all sources have q22 < ∼ 1 and, except for two low-z systems, q22 < 0 (Fig. 11).SF activity is not a dominant source in the radio band, the ob- served radio flux is most likely dominated by synchrotron emission from the relativistic jet.However, it must be taken into account that the plotted samples are highly incomplete.Only 33% of the PSS-CSO-G subsample and 11% of the ECS-G subsample are detected in the W4 band and meet the requirement S/N > 3 for F 22 µm .In particular, there is a bias against normal E, S0, and S galaxies (t SED = 1 or 2), as can be seen in Fig. 11.With some cautions (e.g.Gürkan et al. 2014), the WISE colours W1 − W2 and W2 − W3 provide a useful tool for selecting AGNs and detecting SF activity in galaxy samples (e.g.Stern et al. 2012;Jarrett et al. 2017;Cluver et al. 2017).In the W1−W2 vs. W2 − W3 diagram, galaxies with little hot dust emission occupy a narrow sequence at W1 − W2 ≈ −0.2 to + 0.5 mag, where the colour index W2 − W3 is a good indicator of the SF activity.Galaxies with high SFR, including luminous infrared galaxies (LIRGs) and ultra-luminous infrared galaxies (ULIRGs) are found at the right-hand side of this sequence at W2 − W3 > 3.5 mag ('SF region').Galaxies with little or absent SF populate the left-hand side at W2−W3 < 2 mag ('passive region').The galaxy population in the intermediate area is attributed to normal spiral disks with moderate SFR.AGNs with hot dust populate the colour space above the threshold W1 − W2 = 0.8 mag ('AGN region').
Figure 12 shows the W1 − W2 vs. W2 − W3 diagram (Vega magnitudes) for the PSS-CSO sample (top) and the ECS sample (middle), where only sources with reliable signal-to-noise ratios are plotted (S/N > 5 for the W1 and the W2 band, S/N > 3 for W3).In the passive region, all PSSs and CSOs and the majority of the ECSs have MBSEDs of early-type galaxies.In the AGN region, on the other hand, the vast majority of the sources belong to the QSO subsamples with MBSEDs classified as AGNs or composites.In both two samples, several objects with AGNtype MBSEDs have W1 − W2 values below the AGN demarcation.This is most likely due to a 'dilution' of the AGN radiation by stellar light in the bands W1 and W2 (Donley et al. 2012).The proportion of sources in the AGN region is similar in both samples (53 ± 5 % for PSS-CSO and 59 ± 5 % for ECS).In the non-AGN region, however, the distribution along the W2 − W3 axis is different: For the PSS-CSO sample, we find 9 ± 3 % in the passive and 38 ± 5 % in the intermediate plus SF region, compared to 24 ± 5 % and 17 ± 4 % for the ECS sample.The bottom panel of Fig. 12 shows the distribution of W2 − W3 for the galaxies (subsamples PSS-CSO-G and ECS-G) with W1 − W2 < 0.8 and z < 0.6.The redshift limit was chosen because the strong polycyclic aromatic hydrocarbon (PAH) features around ∼ 11 µm, which are usually considered indicators for massive young stars, are no longer covered by the W3 band at higher redshifts.With only one exception, WISE measurements are available for all PSSs, CSOs, and ECSs with z < 0.6.It can be seen that the PSS-CSO-G sample has a larger contribution from SF activity.We applied again the one-tailed two-sample Z test of proportions to test the null hypothesis H 0 : and found that H 0 can be rejected in favour of H A at a high significance level (p = 0.003).The Z test for all sources at z < 0.6, i.e. without restricting S/N, gives the same result (p = 0.01).
The WISE colour W3 − W4 is particularly sensitive to the strength of SF because a high SFR leads to strong emission by a warm dust component, which contributes more to the W4 band than to the W3 band.The colour-colour diagram W1 − W2 vs. W3 − W4 is shown in Fig. 13 for the sources with S/N > 5 in the bands W1 and W2, and S/N > 3 in W3 and W4.As in Fig. 12, the 'AGN cloud' at W1 − W2 > 0.8 is clearly seen.In the non-AGN region (W1 − W2 < 0.8), the sources cover a wider interval of W3 − W4 than the AGN cloud.At first glance, the distribution of the PSSs and CSOs along the W3 − W4 axis is similar to those in the colour-magnitude diagrams presented by Nikutta et al. (2014) for huge samples of QSOs, AGNs, and SF galaxies, respectively.They found the following median values µ of the W3 − W4 distribution: 2.45 ± 0.19 for 14 795 QSOs, 2.54 ± 0.29 for 4 509 AGN galaxies, and 2.19 ± 0.33 for their sample of 38 092 SF galaxies.Here we derived the median values µ(W3 − W4) = 2.55 ± 0.05 for the PSS-CSO and 2.53 ± 0.05 for the ECS sample.These values are similar to those given by Nikutta et al. (2014) for their samples of AGN galaxies and QSOs.The bottom panel of Fig. 13 shows the histogram distribution of W3 − W4 for the galaxies at z < 0.6 in the PSS-CSO and the ECS samples.We note that reliable measurements of W4 are only available for a small number of galaxies.
The interpretation of the W3 − W4 distribution in the non-AGN region is not entirely clear.When only the galaxies with W1 − W2 < 0.8 and z < 0.6 are considered, the resuting median values are µ(W3 − W4) = 2.34 ± 0.16 for the PSS-CSO subsample and 2.42 ± 0.08 for the (very small) ECS subsample.The blue PSSs and CSOs with W3 − W4 < 2 all have MBSEDs classified as early-type galaxies (t SED = 1).On the other side, the very red sources with W3 − W4 > 3 were classified as spiral or starburst galaxy.The reddest galaxy in the top panel of Fig. 13 is SDSS J075756.71+395936.1 (z = 0.066) with W3 − W4 = 3.66.Based on its MBSED, it was classified as an Sb galaxy, its SFR of ∼ 16, M ⊙ yr −1 (Chang et al. 2015) is the highest in our PSS-CSO sample (see Sect. 5 and Table 5).Investigating a sample of flat-spectrum NLS1s, Caccianiga et al. (2015) argued that red colours W3 − W4 > 2.5 cannot be explained by AGN template spectra alone, but require a significant SF component.However, other effects may also contribute to a red W3 − W4 colour: warm dust components heated by the AGN (polar dust and possible warm dust related to the dusty torus) or the suppression of PAHs by the AGN (Järvelä et al. 2022, and references therein).

Star formation rates and stellar masses
SFR values from simple model fits are unreliable for galaxies where the fluxes are likely to be significantly affected by the AGN component.It has been known for a long time that the 4000 Å discontinuity in the spectra of galaxies correlates with the ratio of the present SFR to the past-averaged SFR and can thus be used to measure SF histories (e.g.Bruzual 1983;Kauffmann et al. 2003b;Gallazzi et al. 2014;Haines et al. 2017;Borghi et al. 2022), where it must be noted that there is also a correlation with metallicity (Vazdekis et al. 2015).Brinchmann et al. (2004) suggested to use the 4000 Å break index D4000n (Balogh et al. 1999) to estimate the in-fibre SFR in composite and AGN galaxies.This index expresses the ratio of the average flux densities F ν in the narrow bands 4000 -4100 Å and 3850 -3950 Å.The values of D4000n are available from the MPA-JHU catalogues (SDSS table galSpecIndx) for ∼ 80% of the galaxies (class = GALAXY) in our samples.For the other galaxies, we measured D4000n from the SDSS spectra.Using reliable data only (S/N > 3), the mean values of D4000n are 1.70 ± 0.04 for the PSS-CSO and 1.88 ± 0.02 for the ECS sample (Table 2).The larger value for the latter is consistent with the stronger contribution of the old stellar population seen in the composite spectrum (Sect.4.1).
Robust data on SFRs and stellar masses for more than 8 10 5 galaxies of the full SDSS spectroscopic galaxy sample have been provided by Chang et al. (2015).Based on the combination of SDSS and WISE photometry, the authors created SEDs that cover the wavelength range 0.4 -22 µm.They employed MAGPHYS for SED modelling that consistently treats stellar emission along with absorption and re-emission by interstellar dust.The differences between their data and the MPA-JHU data is based on the addition of the MIR fluxes, the use of updated Galactic extinction corrections, and different dust attenuation laws.Cross-correlation with the Chang et al. (2015) catalogue yields 38 matches for our PSS-CSO-G sample and 40 matches for the ECS-G sample.Following the recommendation of Chang et al. (2015), we focus on objects with flag = 1, i.e. galaxies with reliable aperture corrections, good WISE photometry and good-quality SED fits.This reduces the samples to 19 (PSS-CSO) and 27 (ECS) galaxies, all at z < 0.2.Among them, there is only one galaxy with t SED > 2, namely the ECS galaxy SDSS J094124.02+394441.8, which has been classified as a starburst galaxy (t SED = 3).More than 90% of the galaxies in both samples have SFRs < ∼ 5M ⊙ yr −1 .Larger SFRs are found only for SDSS J075756.71+395936.1 (10 M ⊙ yr −1 ) from the PSS/CSS Filled squares signify sSFR data from good model fits (FLAG=1).The dashed line indicates the boundary between the SF sequence (above) and passive galaxies (below).The dotted line is a power-law approximation of the mean values from Brinchmann et al. (2004, their Fig. 11).The spectral types t SED from the MBSED fit are colour coded as in Fig. 12. Bottom: Histogram distributions of the D4000n index for the PSS-CSO (blue) and ECS (green) galaxies with S/N > 3.
The ratio of the present to the past-averaged SFR is expressed by the specific star formation rate sSFR = SFR/M * , where M * is the stellar mass.The sSFR versus D4000n dia-  2015) for the PSS-CSO and the ECS samples.The colour coding of the symbols is the same as in Fig. 14, filled squares signify galaxies with good model fits.The cyan curves are local density contours maps for ∼ 9 • 10 4 SDSS galaxies.The black lines indicate the SF sequence (solid) and the 3σ scatter (dashed), the blue lines are the corresponding extrapolations to log M * /M ⊙ = 12.Bottom: Histogram distributions of the SFR for the PSS-CSO (blue) and ECS (green) galaxies with FLAG = 1.gram for ∼ 10 5 objects from the main SDSS galaxy sample with redshifts z < 0.2 shows two clouds with the maximum population densities at D4000n ∼ 1.3 and 1.9, respectively (Kauffmann et al. 2003b;Brinchmann et al. 2004).The first cloud comprises SF galaxies, the second one consists of old elliptical galaxies.Figure 14 shows the sSFR versus D4000n diagrams for our sam-ples.The dotted line marks the mean relation for the SF sequence and the dashed line indicates the border between the SF and the passive region, both taken from Brinchmann et al. (2004, their Fig. 11).As expected, the population below the demarcation line is strongly dominated by galaxies classified as early type.The diagrams look different for the two samples with more mainsequence galaxies in the PSS-CSO sample: 37 ± 11 % of the PSS-CSO galaxies with flag=1 lie above the demarcation line, but only 11 ± 6 % of the ECS galaxies (42 ± 8 % and 8 ± 4 % respectively for all galaxies).
The bottom panel of Fig. 14 shows the histogram distributions of the D4000n index.There is a strong peak at D4000n ∼ 1.9 in the comparison sample, reflecting again the dominance of old stellar populations.This peak is also present in the PSS-CSO sample, but there is a stronger peak at ∼ 1.6, which reflects the clump at the end of the SF sequence seen in the top panel.According to the two-sample KS test, the difference between both distributions is highly significant (p = 3 • 10 −5 ).
Figure 15 displays the SFR vs. stellar mass diagrams.For comparative purposes, we overplotted the distribution of the data from Chang et al. (2015) for ∼ 9 • 10 4 galaxies from the SDSS spectroscopic sample with z < 0.2 by equally spaced local density contours. 9Also plotted is a power law approximation of the SF sequence and the 3σ limits with σ = 0.39 dex, both from Chang et al. (2015).Both the PSS-CSO and the ECS samples span a relatively narrow range of stellar masses at the high-mass end of the galaxy distribution.The derived masses are at the lowmass end of the mass distribution of the brightest cluster galaxies (Dalal et al. 2021), but are typical of luminous radio galaxies.On the other hand, the SFRs cover a wide range from close to the SF sequence down to almost zero.The diagrams again indicate a lower proportion of SF galaxies in the ECS sample: 42 ± 10 % of the PSS-CSO galaxies with flag=1 lie above the lower limit of the extrapolated SF sequence, but only 22 ± 8 % of the ECS galaxies (45 ± 8 % and 18 ± 6 % for all galaxies).The histogram distributions of the SFR in the two samples are displayed in the bottom panel of Fig. 15 for the sources with good quality SFR data.According to the two-sample KS test, the difference between the two distributions is significant (p = 0.03) when we include all galaxies, but not for the rather small samples of sources with flag=1.A similar result is found for the sSFR distributions.The mean values of D4000n, log M * , log SFR, and log sSFR for the two samples are listed in Table 2.

Optical morphology
This section addresses the issue of interactions between galaxies and the merging of galaxies as one of the main drivers of core activity.Extended tidal structures, shells, rings, strong lopsidedness, or simply close galaxy pairs are usually taken as signposts of gravitational perturbation induced by a close encounter or merger.Finding such structures in distant galaxies is a challenge.Detecting extended low-surface brightness features requires deep observations and discovering peculiar structures in the inner regions of galaxies requires high spatial resolution.We have taken two approaches to achieve this goal.First, we analysed archival images from the Hubble Space Telescope (HST) 9 Following Chang et al. (2015, their Fig. 11), we used only the 86 923 galaxies above the (redshift-dependent) mass completeness limit for the contour map and applied the standard 1/V max weighting (Schmidt 1968).The contours were computed for a grid size of (∆ log M * , ∆ log SFR) = (0.05, 0.1) and are spaced by a factor of 1.5, starting at 0.66.
for a subsample of 19 sources.Second, we performed a statistical analysis of the subsamples of the low-z galaxies on images from major galaxy surveys.We started with the combination of the images from SDSS and PanSTARRS-1 (PS1, Flewelling et al. 2020) for the sources up to z = 0.2, and then we also analysed the images from the DESI Legacy Surveys (Dey et al. 2019) up to z = 0.3.
If the data set is not too large, the human visual system is still the most efficient and reliable for recognising complex patterns.To determine the fraction of mergers or otherwise morphologically distorted galaxies, a classification system has to be defined.Tidal structures are very diverse, dependent on the properties of the involved galaxies and the merger parameters (e.g.Duc & Renaud 2013;Ren et al. 2020).With regard to the subjectivity of an inspection by eye we prefer here the simple classification scheme that was applied to post-SB galaxies by Meusinger et al. (2017).It classifies galaxy morphology into one of four types that are assigned to the numerical distortion flag t m (index 'm' for merger): 0 = no apparent peculiarities, 1 = weak indications of a structure that seems to be related to a neighbour galaxy (Messier 51 type), 2 = weak lopsidedness, tidal structures, such as streamers, tidal arms, fans, or shells, 3 = the same characteristics as for t m = 2, but very pronounced.Furthermore, t m = −1 was provided if an evaluation is not possible, but this did not apply in any case.
A neighbouring galaxy (in the projection) is only considered to indicate an interaction if the two galaxies are either embedded in a common halo or connected by a light bridge.A lower value of t m can, of course, be the result of factors that make it difficult to detect low-surface brightness structures, such as a bright AGN (e.g.J080133.55+141442.8 in Fig. 16) or blending of the host by a nearby bright object.In general, it is difficult to distinguish minor mergers from old and faded major mergers.A value of t m = 2 does therefore not necessarily mean that the distortions are intrinsically weaker than for t m = 3.In the following, we consider t m ≥ 2 as indicative of strong tidal interactions and mergers.

HST images
The high spatial resolution of the HST is a great advantage for the detection of merger signatures, especially in the inner regions of the galaxies and at higher redshifts.We searched the HST Legacy Archive10 for images of the PSS-CSO hosts from our sample.We found images of 18 PSS-CSO galaxies from different observation programmes.(In addition, the HST image of the discarded source SDSS J101714.23+390121.1 is shown and discussed separately in Appendix A.) Although this subsample is small and too inhomogeneous for the statistical analysis, it gives an impression of the frequency of nearby neighbours and peculiar morphologies.
Figure 16 shows the HST images, which were obtained with different cameras and through different filters.If several images from the same camera are available, they were stacked to increase the signal-to-noise ratio.For the two most distant objects, SDSS J080447.96+101523.7 (z = 1.968) and SDSS J213638.58+004154.2 (z = 1.941), the host galaxies are not visible.The same applies to J133037.69+250910.9 (z = 1.055).11All three objects are spectroscopically classified as quasars.The HST image of J131739.20+411545.6 (z = 0.066) strengthens the case for the existence of PSSs and CSOs in galaxies with significant disk components.The edge-on view makes it difficult to detect possible morphological distortions in this galaxy.All other host galaxies appear distorted or have nearby neighbours (J092405.30+141021.4 and J100800.04+073016.6), which indicates a high merger fraction of PSS-CSO galaxies.

Images of low-z galaxies from major surveys
We applied the above classification scheme to the subsamples of the low-z host galaxies.According to our experience from previous studies (e.g.Meusinger et al. 2017), it is difficult to recognise tidal features from galaxy-galaxy interactions on SDSS images at redshifts larger than about 0.2.Therefore, we restricted this analysis to z < 0.2, which results in a list of altogether 67 galaxies: 33 from the PSS-CSO and 34 from the ECS sample.
One disadvantage of our method is, of course, the inevitable subjective character of the classification by eye.Reliable conclusions about the relative frequency of distorted morphologies of PSS-CSO galaxies are only meaningful in relation to the comparison sample.In order to reduce the risk of a subjective bias towards over-or underestimation of peculiarities in the target sample, it is important to ensure an unbiased assessment of the images of both samples in a 'blind' approach.To do so, we stored the images from both samples in the same archive and selected with stellar origin.On the other hand, J133037.69+250910.9 has AGNtypical MIR colours [3.6] − [4.5] = 0.78 and [5.8] − [8.0] = 1.13.them randomly for the assessment, whereby the examiner could not know which sample they belonged to.Simultaneous inspection of the SDSS and the PS1 i-band image12 significantly reduces the risk of false detection of weak features due to noise or artefacts.A peculiar structure is only considered recognised if it is visible on both images.The images were evaluated in three independent runs.In each run, a merger flag t m was assigned to each galaxy.For 26 PSS-CSO galaxies (81%), identical values were estimated in all three runs.
Based on these results, relative merger fractions f m = N m /N tot were computed, where N m is the number of mergers and N tot is the total number of galaxies in the sample with t m ≥ 0. Using the merger criterion t m ≥ 2, we find 16 mergers in the PSS-CSO but only 4 in the ECS sample.
Deeper optical images were recently provided by the DESI Legacy Imaging Surveys.For our study, using deeper images not only has the advantage of improving the confidence in detecting peculiar morphological structures, but also that the redshift range of the low-z sample can be expanded and thus the statistics are improved.In the second step, we therefore analysed the image cutouts from the DESI Legacy Surveys DR1013 , which completed three imaging projects on different telescopes in the g, r, and z bands and incorporates additional data in the i band.The procedure is the same as before, the z range has been expanded up to z = 0.3.This new low-z sample now consists of 43 galaxies from the PSS-CSO sample (Table 5) and 41 ECS galax-ies.Now we found 26 mergers in the PSS-CSO sample and only six in the ECS sample.The results confirm the previous finding of a significantly higher merger fraction for the PSS-CSO sample compared to for the ECS sample.The mean peculiarity index is t m = 1.47 ± 0.19 for the former and t m = 0.45 ± 0.14 for the latter sample.In the following we only refer to the results from the Legacy Surveys images.The mean t m values from the three runs are listed in the last column of Table 5 along with other properties of the sources from this low-z subsample. 14 Examples of merger galaxies are shown and briefly described in Appendix B. The resulting merger fraction f PSS m = 0.61 ± 0.07 for the PSS-CSO sample is in accordance with the range of 0.4 -0.7 from previous systematic studies (O'Dea & Saikia 2021, and references therein).We note however that the numbers are not directly comparable to each other because of the differences in the observational material, the evaluation methods and the merger criteria.The merger fraction of the ECS sample is much lower, f ECS m = 0.15 ± 0.06.According to the one-tailed two-sample Z test, the difference is highly significant (p < 10 −5 ).  1Because this study is focused on the host galaxies, we list the SDSS names in column 2, not the names of the radio sources.The latter can be found in Liao & Gu (2020).
It is well known that gravitational interactions and merging can elevate the SFR in galaxies.This is also seen in our data.The histograms in Figs. 17 and 18 show the distributions of the number of mergers across the different MBSED types and the merger fraction per type, respectively.62 ± 9 % of the merger galaxies in the PSS-CSO low-z sample show SEDs of star forming galaxies (t SED = 2, 3 or 4).For the non-merger galaxies, this proportion is smaller, 47±12 %.All four PSS-CSO host galaxies from Table 5 with SFR > 10 M ⊙ yr −1 are mergers (Table 5), although the reliability flag is set to 1 for only one them (SDSS J075756.71+395936.1).Conversely, not all disturbed galaxies with t m ≥ 2 have substantial SF, which is expected particularly for dry mergers.

Discussion of the different merger fractions
The merger-AGN connection is the subject of a long-running debate.Both theoretical arguments and numerical simulations suggest that 'wet' galaxy mergers (i.e. with at least one gas-rich component) are capable of efficiently triggering streams of cold gas towards the innermost regions of galaxies, thus in principle creating the conditions for triggering nuclear activity by feeding the central SMBH (e.g.Toomre & Toomre 1972;Sanders & Mirabel 1996;Kauffmann et al. 2000;Di Matteo et al. 2005;Hopkins et al. 2008;Holt 2009;Blecha et al. 2018;Weigel et al. 2018;McAlpine et al. 2020;Zhang et al. 2021).However, despite intensive efforts, the observational evidence is inconsistent.Several studies do not point to a close connection between AGNs and (observable evidence for) recent galaxy mergers (Malkan et al. 1998;McLure et al. 1999;Grogin et al. 2005;Georgakakis et al. 2009;Cisternas et al. 2011;Schawinski et al. 2011;Hewlett et al. 2017;Villforth et al. 2017;Lambrides et al. 2021;Sharma et al. 2021).In contrast, the view that the phenomenon of luminous AGNs is essentially related to interactions between galaxies is supported by other studies (Hutchings & Campbell 1983;Gehren et al. 1984;Keel et al. 1985;Heckman et al. 1986;Stanghellini et al. 1993;Bennert et al. 2008;Karouzos et al. 2010;Ramos Almeida et al. 2011;Bessiere et al. 2012;Villar-Martín et al. 2012;Sabater et al. 2013;Chiaberge et al. 2015;Weston et al. 2017;Goulding et al. 2018;Gao et al. 2020).In particular, galaxy mergers may play a role in triggering powerful radio AGNs (Ramos Almeida et al. 2012;Chiaberge et al. 2015).These seemingly contradictory results could be partly due to differences in the depth of the images used to classify mergers (Bennert et al. 2008;Bessiere et al. 2012) and due to different techniques of selecting the AGN sample (e.g.Ellison et al. 2019, and references therein).Furthermore, strong AGN variability could have an impact if the variability time scale is much shorter than the characteristic lifetime of merger relics (Hickox et al. 2014;Goulding et al. 2018;French et al. 2023).
Another controversial connection is that between the triggering of an AGN and the quenching of SF in the host galaxy.Because of the strong energetic and mechanical feedback, AGNs are assumed to play a decisive role in this process (Fabian 2012;Heckman & Best 2014).However, only a few post-SB galaxies show signs of AGNs in the form of strong AGN emission lines or bright emission in the MIR, X-ray or radio domain (Nielsen et al. 2012;Meusinger et al. 2017;Greene et al. 2020).Recently, Wu et al. (2023) reported a high occurrence rate of AGN-driven [O iii] 5007Å emission derived from ultra-deep spectra of a small sample of post-starburst galaxies, suggesting a higher abundance of AGNs than previously thought.It is not our aim to evaluate the merger fractions from Sect.6.2 in the context of the debate on the merger-AGN-SF connection.However, we think it is important to understand the difference between the rates for the PSS-CSO and the comparison sample.If the PSS-CSO sample represents young radio galaxies in which a fraction f PSS m was triggered by mergers, it seems reasonable to explain the lower fraction in the comparison sample by evolutionary or environmental differences in the observability of the tidal structures.In the following, we examine in more detail the extent to which these two explanations could apply to our samples.

Evolutionary effects
A simple evolutionary scenario is based on the fact that the tidal structures disappear and become invisible after a certain time.If a significant proportion of young radio galaxies is associated with galaxy mergers and the radio sources are still present for a considerable period after the tidal structures are no longer detectable, a sample of young radio galaxies can be expected to have a higher percentage of mergers than a comparison sample of extended sources.The main requirements for this scenario are that (i) the comparison sample represents the more evolved stages of the PSSs and CSOs and (ii) the radio source lifetime is comparable to or longer than the lifetime of the observable morphological merger signatures.
Observations suggest that the merger-AGN association is particularly clear for luminous AGNs obscured by dust (Urrutia et al. 2008;Satyapal et al. 2014;Kocevski et al. 2015;Fan et al. 2016;Weston et al. 2017;Ricci et al. 2017;Goulding et al. 2018;Steinborn et al. 2018;Ellison et al. 2019), which are predominantly found in the late stages of mergers.Patil et al. (2020) studied radio galaxies with extremely red MIR-optical colour ratios and found that these sources are 'consistent with a population of newly triggered, young jets caught in a unique evolutionary stage in which they still reside within the dense gas' and might become GPS or CSS sources.Based on the analysis of cosmological hydrodynamical simulations, McAlpine et al. (2020) investigated the connection between galaxy-galaxy mergers and enhanced SMBH growth.They concluded that, at low redshift, the enhancement of AGNs peaks at ∼ 0.3 Gyr after coalescence.In the late stage of mergers, tidal structures disappear quickly.Barnes (2016) computed the evolution of the tidal fraction f tid , i.e. the fraction of disk particles in the tidal tails, in numerical simulations of major mergers of disk/bulge/halo systems for a set of different merger parameters.From their Fig.14, we derive a mean decline rate of ∆ f tid / f tid = −1.5 ± 0.13 per Gyr in the late stage of the mergers, where we adopted the value ∆t p12 = 0.5 Gyr for the mean time between the first and the second pericentric passage (Mihos & Hernquist 1996).We assume that the tidal fraction can be taken as a proxy of the observability of the tidal features.The typical lifetimes of FR II galaxies are usually assumed to be ∼ 10 6 − 10 8 yr (Allen et al. 2006;O'Dea et al. 2009;Bird et al. 2008;Antognini et al. 2012;Garofalo et al. 2018).Thus, if we assume a mean age of < ∼ 0.05 Gyr for the evolved radio galaxies in the comparison sample, the merger fraction of the comparison sample is expected to be only less than 8% smaller than that of the PSS-CSO sample, which is far from sufficient to account for the observed difference between f PSS m = 0.61 and f ECS m = 0.15.The above evolution scenario is based on the assumption that all PSSs and CSOs evolve into extended radio galaxies represented by the ECS sample.However, if not all PSSs and CSOs evolve (O'Dea & Saikia 2021, and references therein), the merger fraction of evolved radio sources may be lower than that of the young sources due to another effect: If a significant fraction of the merger-triggered PSSs and CSOs get stuck in the dense interstellar medium of the merger core and suffers the 'dying type 3 source' fate described by An & Baan (2012), the counterparts of these merger galaxies are missing from the comparison sample.Let us assume that the proportion f If we set f y m = 0.61 ± 0.07, i.e. equal to the observed merger fraction of the PSS-CSO sample, we find that the low observed merger fraction of the comparison sample, f ECS m = 0.15 ± 0.06, can be reproduced if R 2 = 7.5 ± 1.5.This means that 88 ± 3 % of the merger-triggered sources, or at least ∼ 54 ± 2 % of all young radio sources, do not evolve to extended sources represented by the ECS sample.

Cluster environment
Current evidence suggests that both the persistence of tidal structures and the properties of AGNs depend on the environment of the galaxies (e.g.Mihos 2004;Sabater et al. 2013;Marshall et al. 2018).In particular, tidal structures tend to be weaker and disappear more rapidly in the dense environment of galaxy cluster cores.As a result, the merger fraction of a sample of cluster galaxies would be smaller than that of field galaxies.O'Dea & Saikia (2021) summarised previous studies on small samples of powerful GPSs and CSSs.They concluded that their environment appears to be similar to that of powerful large radio sources and suggested that follow-up studies be carried out on larger samples.
We have used the following large catalogues of galaxy clusters to check the cluster membership.The GMBCG Galaxy Cluster Catalog (Hao et al. 2010) is a catalogue of galaxy clusters selected from SDSS DR7 by identifying the red-sequence plus the brightest cluster galaxies.It contains 55 424 rich galaxy clusters in the redshift range 0.1 < z < 0.55.Wen et al. (2012, WHL) identified 132 684 clusters with 0.05 < ∼ z < ∼ 0.75 on the basis of photometric redshifts of galaxies from SDSS DR8.Wen & Han (2015) published an updated version of the WHL catalogue, using the SDSS DR12 spectroscopic data and additionally a list of 25 419 complementary clusters at high redshifts around bright galaxies.For about 89% of the identified clusters in the SDSS spectroscopic survey region, spectroscopic redshifts are available in the SDSS DR12.The cluster redshifts cover the range 0.05 < z < 0.75.Banerjee et al. (2018) presented a galaxy cluster catalogue constructed from the SDSS DR9, which comprises 46 479 galaxy clusters at z < ∼ 0.64.Complementary to the above data sources, the galaxy cluster catalogues of Burenin (2017), Wen et al. (2018), andKirkpatrick et al. (2021) were included in our search.The extension of the Planck galaxy cluster catalogue by Burenin (2017) comprises ∼ 3 000 galaxy clusters detected in the Planck all-sky Compton parameter maps and identified using data from the WISE and SDSS surveys.Wen et al. (2018) presented a compilation of ∼ 47 600 clusters identified from photometric data of 2MASS, WISE, SuperCOSMOS, and detection by ROSAT and XMM-Newton.Finally, the catalogue of Kirkpatrick et al. (2021) was included, which is the largest compilation of visually confirmed X-ray selected spectroscopic galaxy clusters.The latter lists 2 740 galaxy clusters spectroscopically confirmed using SDSS DR16 spectroscopic data for 33 340 individual galaxy members.In all the catalogues used, the redshift distributions of the clusters drop significantly at z ∼ 0.6.
We consider a galaxy as belonging to a cluster if (a) the projected distance from the catalogued cluster position is d c ≤ 1 Mpc at the redshift of the cluster and (b) the radial component of the relative velocity with respect to the cluster is v rel ≤ 10 3 km s −1 .The latter was determined from the relation v rel /c = (z g − z cl )/(1 + z cl ) (Harrison 1974), where z g and z cl are the redshifts of the galaxy and the cluster, respectively.We identified host clusters in all used catalogues, with the sole exception of the X-ray cluster catalogue by Kirkpatrick et al. (2021).If a nearby cluster was identified in two or more catalogues, we used the following priority list: (1) Wen & Han (2015), (2) Banerjee et al. (2018), (3) Burenin ( 2017), (4) Hao et al. (2010), (5) Wen et al. (2018).The catalogue by Wen & Han (2015) contains by far the most sources from our samples.The order of the other catalogues was chosen arbitrarily.
The criterion of cluster membership is fulfilled by 15 sources from the PSS-CSO and 13 from the ECS sample.This corresponds to 19 ± 4 % (PSS-CSO) and 17 ± 4 % (ECS) cluster members for the redshift range z < 0.6.The difference is not significant according to the two-sided Z-test (p = 0.67).More revealing is the proportion of sources located in the cluster core regions, where the strongest effects are to be expected.Of the PSSs and CSOs, 15 ± 4% (12/78) are found within d c = 0.2 Mpc, and a similar percentage of 12 ± 4 % (9/78) is found for the ECS sample.With only two exceptions, all identified clusters are listed in the Wen & Han (2015) catalogue, where the estimated cluster radius r 500 is also given.The mean values are r 500 = 0.76 ± 0.04 Mpc for the PSS-CSO and 0.72 ± 0.09 Mpc for the ECS sample.These results suggest that the cluster environment does not play a decisive role in the difference between the merger fractions of the two samples.
Finally, we note that the merger fraction among cluster galaxies is significantly lower than in general.Among the PSS-CSOs in clusters, we find four mergers ( f m = 0.27±0.11),among the ECS none at all.The mean merger index is t m = 0.64 ± 1.08 (PSS-CSO) and 0.15 ± 0.38 (ECS), respectively.

Summary and conclusions
We compared a sample of compact and presumably young extragalactic radio sources with a sample of evolved, extended radio galaxies and QSOs.To construct the former, we used the sample from (Liao & Gu 2020) with available SDSS spectra, here referred to as the 'PSS-CSO sample'.The extended comparison sources (ECSs) were selected from a larger sample of extended radio galaxies and QSOs with steep radio spectra.The selection was made in such a way that each object from the PSS-CSO sample has one counterpart in the ECS sample with a similar redshift, 1.4 GHz luminosity, optical luminosity, and SDSS spectral class, where the linear size, measured in VLASS, must be greater than 50 kpc.
This study focuses in particular on spectra, SEDs and morphological properties and is based on archival data in the spectral range from the far UV to MIR.We divided both samples into the subsamples of galaxies and QSOs and generally compared the corresponding subsamples with each other using statistical tests.The main results are the following.
(i) We compared the distributions of the radio galaxies in three different diagnostic diagrams.There are no significant differences in the [N ii]-based BPT diagram.In the KEx diagram, the PSS-CSO sample shows a higher mean line dispersion of the [O iii] line.Both diagrams suffer from a significant bias because a substantial portion of galaxies have weak or missing Hβ lines.The more complete WHAN diagram, on the other hand, shows a significantly higher percentage of passive and retired galaxies in the ECS compared to the PSS-CSO sample.
(ii) In the SDSS composite spectra of the galaxy subsamples, the continuum is sufficiently well described by a mixture of old early-type galaxies and SF galaxies, with a significantly larger contribution from SF galaxies in the PSS-CSO sample.Such a difference is not visible between the continua of the QSO composites.All four composite spectra show relatively strong forbidden emission lines [O iii] and [O ii] in combination with weak Fe blends, as expected for luminous radio galaxies.The PSS-CSO composites show a larger line ratio [O ii]/[Ne v] which can be attributed to SF or to the interaction of the radio jets with the dense interstellar medium of the host galaxy.
(iii) We classified the individual MBSEDs using a set of template spectra.44 ± 7 % of the galaxies from the ECS-G sample are best matched by templates of early-type galaxies, compared to only 31 ± 6 % for the PSS-CSO-G sample.The latter is largely in agreement with Nascimento et al. (2022).The proportion of SF galaxies is significantly larger for PSSs and CSOs than for ECSs, both in the entire sample and in the two subsamples of galaxies and QSOs.The colour-colour diagram based on the first three WISE bands also shows a significantly lower proportion of SF galaxies in the ECS compared to the PSS-CSO sample.
(iv) The distributions of the D4000n index are different for PSS-CSO and ECS galaxies.The mean values are D4000n = 1.70 ± 0.04 for the PSS-CSO-G and 1.88 ± 0.02 for the ECS-G sample.The larger value for the latter is consistent with the stronger contribution of old stellar populations and is consistent with the previous points.The distributions of the SFR and the sSFR from Chang et al. (2015) also differ for the two samples in such a way that they indicate a higher proportion of SF galaxies in the PSS-CSO sample.In the SFR -stellar mass diagram, 42 ± 10 % of the PSS-CSO galaxies with available SFR data lie on the main sequence of SF galaxies, compared to only 22 ± 8 % for the ECS-G sample.For the vast majority, the SFRs and stellar masses of PSS-CSO-G host galaxies cover the ranges from ∼ 0 to 5 M ⊙ yr −1 and 3 • 10 11 to 10 12 M ⊙ , respectively, in good agreement with Nascimento et al. (2022).
(v) We analysed archival HST images of 18 PSSs and CSOs from our sample.In all cases where the host galaxy can be studied, we found indications of gravitational interactions and mergers.Motivated by this high merger fraction, we used the DESI Legacy Surveys images to analyse the morphologies of all objects with z ≤ 0.3 from both samples.We found a higher proportion of mergers in the PSS-CSO (0.61 ± 0.07) compared with the ECS sample (0.15 ± 0.06).This highly significant difference (p < 10 −5 ) can be explained by assuming that 61% of the PSSs and CSOs were triggered by mergers, but 88±3 % of the mergertriggered or 54±2 % of all young radio sources do not evolve and are thus not represented in the comparison sample of extended radio sources.This interpretation is in line with the recent finding by Kiehlmann et al. (2023) that most CSOs belong to a distinct class of jetted AGNs.
(vi) To look for possible differences in the environments of the PSSs-CSOs and the ECSs, we matched the positions and redshifts of the sources with z < 0.6 from both samples with several large galaxy cluster catalogues.The results do not suggest a significant difference.
(vii) We provide further evidence that the host galaxy of the CSS source 4C +39.29 is most likely not SDSS J101714.23+390121.1 at z = 0.211 but SDSS J101714.10+390123.8 at z = 0.526 as suggested already by Gandhi et al. (2006).The observations are consistent with an ongoing jet-cloud interaction in that source.
We conclude that our PSS-CSO sample consists of both young radio sources that evolve into extended radio structures and frustrated or hindered sources that do not.Galaxy-galaxy interactions and mergers appear to play an important role in triggering the radio AGN.These results need to be verified with a larger PSS-CSO sample.

Fig. 1 .
Fig. 1.VLASS contours of the three sources from the Liao & Gu (2020) sample with VLASS sizes larger than 50 kpc.The ten red contour lines are logarithmically spaced between ∼ 5σ of the local noise and the maximum flux of the source.The green dashed lines are negative contours at −5σ to −3σ.The background images are from the SDSS i band.The vertical bar at the bottom indicates a length of 50 kpc at the redshift of the source.

Fig. 2 .
Fig. 2. Comparison of the PSSs and CSOs with extended radio galaxies and quasars in the z − L 1.4GHz − M i parameter space.Left and middle column: Projections of the distributions of the PSS-CSO sample (blue filled squares) and the extended comparison galaxies and quasars.The panels on the left-hand side show the different contributions to the basic sample from which the comparison sample is drawn (small symbols: red plus signs -Shen et al. (2011), black plus signs -Garon et al. (2019), cyan filled squares -Rafter et al. (2011), green plus signs -Kuźmicz & Jamrozy (2021), magenta filled squares -Kozieł-Wierzbowska & Stasińska (2011)).In the middle column, the final ECS sample (green) is compared with the PSS-CSO sample (blue).Right column: histogram distributions of the PSSs and CSOs (blue) and the ECSs (green).

Fig. 3 .
Fig. 3. Frequency distribution of the spectral slope α r (left) and the LLS (right) for the ECS sample.

Fig. 4 .
Fig. 4. Power -linear size diagram for the PSS-CSO (blue) and the ECS (green) samples.Left arrows indicate upper limits for the LLS.For the PSS-CSO sample, the sizes are taken from Liao & Gu (2020).The dashed lines depict evolutionary tracks fromAn & Baan (2012) for sources of high radio-power (red) and low radio-power (blue).

Fig. 5 .
Fig. 5. Diagnostic diagrams for the galaxies (SDSS spectral class = GALAXY) from the PSS-CSO (blue) and the ECS (green) samples with S/N ≥ 3 for used lines.Top: [N ii] based BPT diagram for z < 0.35 with the upper limit for SF galaxies from Kewley et al. (2006) (dashed), the lower limit for AGNs from Kauffmann et al. (2003a) (dotted), and the separation between AGNs and LINERs from Schawinski et al. (2007) (dashed).Middle: KEx diagram with demarcation lines from Zhang & Hao (2018).Bottom: WHAN diagram with demarcation lines from Cid Fernandes et al. (2011).

Fig. 6 .
Fig. 6.Distributions of the types from the diagnostic diagrams in Fig. 5 for the galaxies from the PSS-CSO (blue) and the ECS (green) samples: [N ii]-based BPT diagram (top), KEx diagram (middle), and WHAN diagram (bottom).The vertical bars indicate the standard errors of the proportions.

Fig. 7 .
Fig. 7. Composite spectra for the PSS-CSO (top) and the ECS (bottom) sample.The QSO subsamples are shown in blue, the galaxy samples in red.The former were corrected for slight internal reddening using the SMC dust model to match the composite spectrum of the FBQS radio-loud quasars from Brotherton et al. (2001) (magenta).The spectra for the galaxy subsamples were shifted upwards by 2.5 flux units for clarity.The green spectra are best-fit models for the continuum (see text).The positions of the strongest emission lines are indicated by dotted vertical lines and labelled by ion in the lower panel.

Fig. 8 .
Fig. 8. Residual spectra (composite minus model) for the PSS-CSO-G (a) and the ECS-G (b) subsample.The model spectra were arbitrarily scaled in such a way that the mean flux in the interval 4200-4800 Å is the same as in the corresponding observed composite spectrum.

Fig. 9 .
Fig. 9. MBSEDs (in the rest frame) for a selection of nine sources from the PSS-CSO sample (black symbols with error bars).Overplotted are the SDSS spectrum (green) and the best-fitting template (red).The number and name of the best-fitting template are given in the top right corner of each panel.

Fig. 10 .
Fig. 10.Histogram distributions of type t SED for the G (top) and the Q subsample (bottom) of the PSS-CSO (blue) and the ECS (green) sample.The vertical bars indicate the standard errors of the proportions.

Fig. 11 .
Fig. 11.MIR-to-radio flux density ratio q22 vs. redshift for the PSS-CSO (top) and the ECS (bottom) sample; only sources with reliable type classification from the MBSED fitting and with S/N > 3 for the W4 flux density are plotted.Filled squares stand for the QSO subsample, open squares for the G subsample.The colours indicate types from MBSED fitting (see inset).

Fig. 12 .
Fig. 12. WISE W1 − W2 vs. W2 − W3 diagram for the PSS-CSO (top) and the ECS (middle) sample.The symbols and colours have the same meaning as in Fig. 11.The dotted vertical demarcation lines are from Jarrett et al. (2017), the horizontal line marks the AGN threshold from Stern et al. (2012).Bottom: Histogram distributions of W2 − W3 for the PSS-CSO-G (blue) and ECS-G (green) galaxies with W1 − W2 < 0.8.Only sources with S/N > 5 in the bands W1 and W2 and with S/N > 3 in the band W3 are plotted.

Fig. 13 .
Fig. 13.The same as in Fig. 12, but for W1 − W2 vs. W3 − W4.Only sources with S/N > 5 in the bands W1 and W2 and with S/N > 3 in the bands W3 and W4 are plotted.

Fig. 14 .
Fig. 14.D4000n index and sSFR.Top and middle: sSFR from Chang et al. (2015) versus D4000n for the PSS-CSO and the ECS samples.Filled squares signify sSFR data from good model fits (FLAG=1).The dashed line indicates the boundary between the SF sequence (above) and passive galaxies (below).The dotted line is a power-law approximation of the mean values fromBrinchmann et al. (2004, their Fig.11).The spectral types t SED from the MBSED fit are colour coded as in Fig.12.Bottom: Histogram distributions of the D4000n index for the PSS-CSO (blue) and ECS (green) galaxies with S/N > 3.

Fig. 15 .
Fig. 15.Stellar mass and SFR.Top and middle: SFR versus stellar mass fromChang et al. (2015) for the PSS-CSO and the ECS samples.The colour coding of the symbols is the same as in Fig.14, filled squares signify galaxies with good model fits.The cyan curves are local density contours maps for ∼ 9 • 10 4 SDSS galaxies.The black lines indicate the SF sequence (solid) and the 3σ scatter (dashed), the blue lines are the corresponding extrapolations to log M * /M ⊙ = 12.Bottom: Histogram distributions of the SFR for the PSS-CSO (blue) and ECS (green) galaxies with FLAG = 1.

Fig. 16 .
Fig. 16.Image cutouts from the HST Legacy Archive (WFPC2, ACS, WFC3/UVIS with various filters) for 18 PSSs and CSOs (always in the image centre).The image size is adapted to the size of the ambient structure, the bar indicates 10 kpc at the distance of the source (N up, E left).Article number, page 18 of 39

Fig. 18 .
Fig. 18.Merger fraction per MBSED type t SED for PSS-CSO (blue) and ECS galaxies (green) with z < 0.3.The vertical bars indicate the standard errors of the proportions.
episodic sources or 'impostors' which do not evolve.The ratio of the merger fraction to the non-merger fraction for the evolved sources is R 1 = f e m thus the merger fraction of the ECS sample results in f ECS m

Fig. B. 2 .
Fig. B.2.As in Fig. B.1, but for the ECS host galaxies with z < 0.3 and t m ≥ 2.0.

Table 1 .
Sizes of sources from the Liao & Gu (2020) sample found to be extended in the VLASS.
Notes.The angular sizes are as measured from the VLASS images.A colon in the second column indicates uncertain classification.

Table 3 .
Diagnostic line ratios measured from the composite spectra.
sample log [O ii]/[Ne v] log [O ii]/[O iii] log [O iii]/Hβ log [N ii]/Hα FWHM [OIII] (The ratio [O ii]/[O iii] for the ECS-Q subsample is close to that for the SDSS quasar composite spectrum (Vanden Berk et al. 2001).The value for the PSS-CSO-Q subsample is twice as high and thus indicates a lower ionisation.This is again in agreement with the interpretation of the strength of the [O iii] line.The same trend is indicated by the comparison of the PSS-CSO-G and ECS-G composites.For the PSS-CSO-G subsample the line ratio [O ii]/[O iii] is slightly larger than 1, as typical of LINERs.